Home
Softono
IBM-Data-Engineering-Professional-Specialization-Coursera

IBM-Data-Engineering-Professional-Specialization-Coursera

Open source MIT Python
35
Stars
18
Forks
0
Issues
1
Watchers
1 year
Last Commit

About IBM-Data-Engineering-Professional-Specialization-Coursera

Solution for IBM Data Engineering Professional Certificate

Platforms

Web Self-hosted

Languages

Python

Links

IBM Data Engineering Professional Specialization - Coursera

Description

In this repo, I recap my solutions for the assignments for the 15-month IBM Data Engineering Professional Specialization on Coursera that I have done in less than 3 weeks. The specialization contains:

  • Create, design, and manage relational databases and apply database administration (DBA) concepts to RDBMSes such as MySQL, PostgreSQL, and IBM Db2.
  • Develop and execute SQL queries using SELECT, INSERT, UPDATE, DELETE statements, database functions, stored procedures, Nested Queries, and JOINs.
  • Demonstrate working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, Spark Streaming.
  • Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards.​

There are 13 courses throughout the specialization and a capstone project at the end:

  1. Introduction to Data Engineer
  2. Python for Data Science, AI & Development
  3. Python Project for Data Engineer
  4. Introduction to Relational Databases (RDBMS)
  5. Databases and SQL for Data Science with Python
  6. Hands-on Introduction to Linux Commands and Shell Scripting
  7. Relational Database Administration (DBA)
  8. ETL and Data Pipelines with Shell, Airflow and Kafka
  9. Getting Started with Data Warehousing and BI Analytics
  10. Introduction to NoSQL Databases
  11. Introduction to Big Data with Spark and Hadoop
  12. Data Engineering and Machine Learning using Spark
  13. Data Engineering Capstone Project

Tools and Technologies

  • OLTP database - MySQL
  • NoSql database - MongoDB
  • Production Data warehouse – DB2 on Cloud
  • Staging - Data warehouse – PostgreSQL
  • Big data platform - Hadoop
  • Big data analytics platform – Spark
  • Business Intelligence Dashboard - IBM Cognos Analytics
  • Data Pipelines - Apache Airflow

Certificates

  • IBM Data Engineering Foundations alt text
  • IBM Data Warehouse Engineer Proffesional alt text
  • IBM Data Engineering Proffesional alt text