05696601 Introduction to Big Data Ecosystem 🏖️🌏
Academic Year 2025 | Semester 2
BigGrade | 4rum
Course Description
This course wraps around three main characteristics of Big Data, also known as 3Vs (Volume, Variety, Velocity). Students will learn challenges posed by each characteristic along with techniques to tackle each of them. Those techniques include database partitioning/replication, distributed processing, schemaless data models, object storage, data warehouse, data lake, lakehouse, and stream processing. Modern data platforms (e.g., Spark, Ray, MongoDB, Neo4j, MinIO, dbt, DuckLake, Databricks, Apache Kafka) will be leveraged to demonstrate those techniques.
Course Information
  • Time: Monday 9:00 - 12:00
  • Location: Sc08 Room 714
  • Office Hours: Monday 13:00 - 16:00
Course Evaluation
  • Homework Assignment 20%
  • Lab Assignment 20%
  • Project 40%
  • Final Exam 20%
Course Staff

Yuttapichai Kerdcharoen (Guide)
Instructor
Malapchai Chaisihat (Nueng)
Head Teaching Assistant
Schedule
Date Topic
Nov 24 Lecture #1 - Introduction to Big Data
Online A1 Out
Slides Video
Dec 1 Lecture #2 - Evolution of Data Technologies
Online
Slides Video
Dec 8 Lecture #3 - Distributed Databases (Sharding and Replication)
Online A1 Due
Slides Video
Dec 15 Lab #1 - Building Data Pipelines with dbt Slides
Dec 22 Lecture #4 - Distributed Processing ⚡️ Slides Video
Dec 29
New Year Week
New Year Week (No Class)
Jan 5 Lab #2 - MongoDB 101 / Neo4j 101 Slides
Jan 12 Commencement Day (No Class)
Make-Up Class
Jan 19 Midterm Week (No Class)
Jan 26 Lab #3 - PySpark for Distributed Data Transformation
A2 Out Project Team Registration
Feb 2 Lecture #5 - Modern Data Warehouse, Data Lake, and Lakehouse ⚡️
Feb 9 Lab #4 - Building Data Lakes with DuckLake and MinIO ⚡️
Project Proposal
Feb 16 Lecture #6 - Stream and Incremental Processing ⚡️
A2 Due
Feb 23 Lab #5 - Apache Kafka ⚡️
Project Checkpoint 1 Due
Mar 2 Guest Lecture
Mar 9 Lecture #7 - Emerging Big Data Technologies ⚡️
Project Checkpoint 2 Due
Mar 25 Final Exam
Mar 31 (TBA) Project Presentation
Assignment