Lecture Notes 2024

Week 1:
  • Course Introduction
  • What is Big Data? Where Does Big Data Come From? [PDF]PPTX]
  • What we can do and what should we do with Big Data? [PDF]PPTX]

Week 2:
  • Introduction to Feature Manipulation [PDF]
  • Feature Selection: Wrapper approaches and Sequential Search
    • Reading: Data Preprocessing [PDF]

Week 3:
  • Feature Selection: Filter and Embedded approaches
    • Filter Feature Selection [PDF]
    • Embedded Feature Selection [PDF]
    • Extra Notes --- COMP 307 Decision Tree Learning with An Example [PDF]

Week 4:
  • Feature Construction [PDF]

Week 5:
  • Nonlinear Dimensionality Reduction: Manifold Learning

Mid-Trimester Teaching Break


Week 6:
  • Clustering

Week 7-8:

Week 8-9:
  • Regression 2: Moving Beyond Linearity [PDF][PPTX]

Week 9-10: Hadoop MapReduce [PDF][PPTX]

Week 10-11: Apache Spark [PDF][PPTX]

Week 12: Spark Machine Learning Libraries