Data Engineering
Information relating to topics on Data Engineering, Data Infrastructure, Data Storing, etc. Also an in depth look at Data Analytics, Data Pipelines and Business Intelligence through the eyes of big data technologies and frameworks such as Apache Spark, Apache Cassandra, and Embulk.
References
Data Warehouse Toolkit Third Edition - Ralph Kimball, Margy Ross (https://www.amazon.com/Data-Warehouse-Toolkit-Definitive-Dimensional/dp/1118530802/ref=pd_lpo_sbs_14_t_0?_encoding=UTF8&psc=1&refRID=594QZ596BPTX3YC1BPY0)
Essential SQL Alchemy 2nd Edition - Jason Myers, Rick Copeland (https://www.amazon.com/Essential-SQLAlchemy-Rick-Copeland/dp/0596516142)
Technologies at a Glance
- Apache Spark - https://spark.apache.org/
- Apache Cassandra - http://cassandra.apache.org/
- Embulk - http://www.embulk.org/docs/