This repository is a place for the Data Warehousing course at the Information Systems & Analytics department, Santa Clara University.
-
Updated
Apr 19, 2026 - Jupyter Notebook
This repository is a place for the Data Warehousing course at the Information Systems & Analytics department, Santa Clara University.
A large-scale data framework that will enable us to store and analyze financial market data and drive future predictions for investment.
This project simulates a real-world enterprise data migration and modernization strategy. It extracts transactional data from a simulated "On-Premise" environment (hosted on AWS EC2), performs heavy distributed processing using a Hadoop/Spark cluster, and ultimately serves the data via a Cloud-Native, serverless architecture to optimize costs .
SQL, Databases, warehouses, Data lake, cloud storage, MYSQL, Data Pipeline
SQL, Databases, warehouses, Data lake, cloud storage, MYSQL, Data Pipeline
This repo contains HR Analytics project to analyze what factors impact employee attrition using dataset for Atlas Labs Company.
YAML format for star schema and snowflake schema
Open-source Supply Chain analytics on Microsoft Fabric: a scalable Bronze-Silver-Gold pipeline with automated CSV ingestion, Delta Lake transforms, semantic modeling (DAX & RLS) and interactive Power BI reports. Join to enhance pipelines, refine models, and build next-gen supply-chain insights!
A data visualization project using Power BI to analyze trends, business metrics, and key insights through interactive dashboards.
This repository contains practical examples of data warehousing concepts, including star schema and ETL processes, all implemented using MySQL.
TIL: Data Warehouse and ETL Architecture
Enterprise-grade Sisense ElastiCube architecture & SQL modeling for AMEX. Features a hybrid ETL pipeline, Snowflake schema design, and 360° operational visibility dashboards.
DW://master is an interactive educational platform for mastering data warehousing concepts — from core architecture to advanced slowly changing dimensions (SCD), schema design, IBM Watsonx.data lakehouse technology, and SQL aggregation analytics. Features an AI tutor powered by Claude Sonnet 4 that answers questions about each topic in real-time📊.
✅ Terraform module to create and manage Snowflake databases and schemas using infrastructure as code.
Complete materials for "Data Warehousing and Modeling" uni course: lectures (Inmon, Kimball, Dimensional Modeling, NoSQL), PostgreSQL labs (Star Schema), exam questions, and practical SQL solutions.
Building a powerbi dashboard by pulling da
Hackolade(https://hackolade.com) plugin for Snowflake
🗄️ IBM Relational Database Administrator with GenAI Certificate Portfolio – A comprehensive collection of projects, labs, and assignments showcasing expertise in relational database administration, 🏘️data warehousing, 🔁ETL pipelines, and 🤖Generative AI integration for modern database management.
A comprehensive 📚portfolio showcasing hands-on projects and skills acquired through the IBM Data Warehouse Engineer certification program. Features real-world data warehousing solutions, ETL pipelines, cloud data platforms, and 📈business intelligence implementations with 🏢enterprise-grade technologies.
Add a description, image, and links to the snowflake-schema topic page so that developers can more easily learn about it.
To associate your repository with the snowflake-schema topic, visit your repo's landing page and select "manage topics."