Home
Softono
data-warehousing

data-warehousing

Open source Jupyter Notebook
12
Stars
3
Forks
0
Issues
5
Watchers
3 weeks
Last Commit

About data-warehousing

This repository is a place for the Data Warehousing course at the Information Systems & Analytics department, Santa Clara University.

Platforms

Web Self-hosted

Languages

Jupyter Notebook

Links

data-warehousing
Sprint Quarter 2026
March 31 - June 6, 2026

What is a Database?

the DATABASE helps to perform the fundamental operation of business:

  • keep customer records
  • product records
  • transactions,
  • patient profile records
  • patient DNA samples,
  • ...

What is a Data Warehouse?

the DATA WAREHOUSE helps you to analyze your business.

What is a Data Warehouse?

    Where do you store a petabyte of data for  
    business intelligence?  A  data warehouse, 
    that’s where. Data spans for 2+ years...

    At  its  core, a   Data  Warehouse  is  a 
    centralized, read-optimized system designed 
    specifically for querying, reporting,  and 
    analytics. It’s where  your business  data 
    goes  after it’s  been cleaned, structured,  
    and denormalized for performance. 

    Unlike operational databases that are built 
    for transactions —  fast  inserts,  updates, 
    deletes — a warehouse is built for OLAP (online 
    analytical processing): aggregations, joins, 
    historical trends, and slow-but-heavy queries.

What is Data Warehousing?

    Data  warehousing (DW) is a technique 
    of gathering and  analyzing data from 
    many sources to get valuable business 
    insights. Typically, a data warehouse 
    integrates and analyzes business data 
    from many sources.  

    The data  warehouse  is the  basis of 
    the business intelligence (BI) system, 
    which can analyze and report on data.

Skills you will develop:

    Derive insights from data by 
    building analytical, managerial 
    and baseline technical skills.

Classic Data Warehousing


Medallion Architecture Data Warehousing


Example Medallion Architecture


1. Git Repository

2. Instructor, Adjunct Professor: Mahmoud Parsian

3. Teaching Assistant (TA): Mohini Yadav

4. Course Information & Description

5. Academic Integrity

6. Class Meeting Dates & Hours

7. Prerequisite

8. Glossary of Big Data, MapReduce, Spark, Data Warehousing

9. Required Books

10. Required Software: MySQL, Python, Tableau

11. Syllabus, Spring Quarter 2026

12. Grading and Class Conduct

13. Python Tutorials

14. SQL Style Guide

15. SQL Tutorials

16. Office Hours

17. Midterm Exam

18. Final Exam

19. Mahmoud Parsian's Published Books