Home
Softono
spark-data-pipeline

spark-data-pipeline

Open source Scala
15
Stars
8
Forks
12
Issues
4
Watchers
3 years
Last Commit

About spark-data-pipeline

This project describes how to write full ETL data pipeline using spark.

Platforms

Web Self-hosted

Languages

Scala

spark-data-pipeline

Elasticsearch Setup

i) Download the Elasticsearch 6.3.0 or latest version and unzip it.

ii) Run the following command.

    $ bin/elasticsearch

Getting Started:

Clone and run in local mode:

    $ git clone [email protected]:techmonad/spark-data-pipeline.git
    $ cd spark-data-pipeline
    $ sbt run