Skip to the content.

10718: Machine Learning in Practice

Previous Versions: Fall 2022 Fall 2021 Fall 2020 Spring 2020

Fall 2023: Tues & Thurs, 3:30-4:50 (HOA 160)

Important

Class Description

This is a project-based course designed to provide students training and experience in solving real-world problems using machine learning, while exploring the interface between research and practice with a particular focus on topics in fairness and explainability.

The goal of this course is to give students exposure to the nuance of applying machine learning to the real-world, where common assumptions (like iid and stationarity) break down, and the growing needs for (and limitations of) approaches to improve fairness and explainability of these applications. Through project assignments, lectures, discussions, and readings, students will learn about and experience building machine learning systems for real-world problems and data, as well as applying and evaluating the utility of proposed methods for enhancing the interpretability and fairness of machine learning models. Through the course, students will develop skills in problem formulation, working with messy (aka real) data, making ML design choices appropriate for the problem at hand, model selection, model interpretability, understanding and mitigating bias & disparities, and evaluating the impact of deployed models.

People

Instructor

Rayid Ghani

GHC 8023
Office Hours:
Tue 2-3, Thu 1-2

Teaching Assistant

Catalina Vajiac  
 
Office Hours: Monday 2-3pm, Thursday 2-3pm GHC 8018

Grading

Note that this course is being offered pass/fail. Each time you ask how each action you take will affect your grade will result in lowering of the grade :)

Weekly project update assignments (10%)

Midterm take-home exam (20%)

Write-up on interpretability findings (15%)

Write-up on fairness findings (15%)

Group presentation (5%)

Final reflection write-up (15%)

Quizzes on readings and concepts (5%)

Class attendance and participation (10%)

Submitting weekly check-in and feedback forms (5%)

Schedule

See the detailed syllabus below for much more detail as well, including links to required readings and information about group projects, grading, and helpful optional readings.

Week Dates Topic Assignments
1 Tu: Aug 29 Class Intro and Overview  
1 Th: Aug 31 ML Project Scoping Project Team Selection
2 Tu: Sep 5 Getting, Storing, and Linking Data Individual Assignment: Getting to know the class project (due Monday)
2 Th: Sep 7 Analytical Formulation / Baselines  
3 Tu: Sep 12 Model Selection Methodology Project Assignment 1: Formulation and Baseline (due Monday)
3 Th: Sep 14 Performance Metrics  
4 Tu: Sep 19 Feature Engineering and Imputation Project Assignment 2:
Validation set up
Initial pipeline with train and validation set(s) and baseline implemented (due Monday)
4 Th: Sep 21 ML Pipelines  
5 Tu: Sep 26 Models/hyperparameters in practice Project Assignment 3:
list of features and some subset implemented (due Monday)
5 Th: Sep 28 Temporal Model Selection  
6 Tu: Oct 3 Review of modeling results Project Assignment 4:
modeling results (due Monday)
6 Th: Oct 5 Common ML Pitfall in Practice  
7 Tu: Oct 10 Module 1 Review: Applied ML - End to End Pipelines Updated model results assignment (+ model selection) Due Monday
7 Th: Oct 12 no class for midterm time Take-Home Midterm Available
8 Tu: Oct 17 No Class - Mid-semester break  
8 Th: Oct 19 No Class - Mid-semester break  
9 Tu: Oct 24 ML Ethics Issues Overview Add assignment: importances and cross tabs
9 Th: Oct 26 Interpretability: Intro and Overview, taxonomy  
10 Tu: Oct 31 Understanding the Models importances + cross tabs assignment due
10 Th: Nov 2 Interpretability Methods: Inherently Interpretable (GA2Ms, RiskSLIM, etc.)  
11 Tu: Nov 7 Interpretability Methods:: Post-Hoc Local/Feature-based (LIME, SHAP  
11 Th: Nov 9 Interpretability Methods: Other methods (counterfactual DICE, example-based ProtoDash, etc.) Interpretability Writeup Due on Friday
12 Tu: Nov 14 Fairness in ML Overview  
12 Th:Nov 16 Field Trials and Causality  
13 Tu: Nov 21 Fairness Methods: Pre-processing (removing sensitive attribute, sampling)  
13 Th: Thanksgiving Thanksgiving holiday  
14 Tu: Nov 28 Fairness Methods: In-processing (Zafar, Celis, fairlearn, etc.)  
14 Th: Nov 30 Fairness Methods: Post-Processing (Hardt, LA, etc)  
15 Tu: Dec 5 Module 3 Review: ML Fairness  
15 Th: Dec 7 Wrap-Up Bias Writeup Due on Friday
  Finals Week   Final Reflection Writeup Due (Date TBD)

Projects and Deliverables

Broadly, the course will be divided into three modules: 1) applied end-to-end machine learning pipelines, 2) model interpretability, and 3) fairness in machine learning. Throughout the course, students will work in groups of 4-5 on an applied project based on a real-world problem to explore the ideas and methods covered in each module in detail. During the project, students will be responsible for several key deliverables:

Structure

The course is divided into three modules:

  1. Applying ML to Practical Problems

  2. Understanding ML Models

  3. Fairness in ML

Below is a preliminary schedule of the course, including the readings that will be assigned for that week. Please be sure to have read and be prepared to discuss the readings before the specified class session. Most of these topics can be (and often are) the focus of entire courses and generally, we’ll only scratch the surface, but hopefully inspire you to delve deeper into areas that interest you (and you’ll find plenty of open research questions in each). Optional readings are also listed for most sessions which may be of interest to students who wish to delve deeper into a given area as well as provide additional context for your related project work.

MODULE 1: APPLYING ML TO PRACTICAL PROBLEMS**

MODULE 2: UNDERSTANDING ML MODELS**

MODULE 3: FAIRNESS IN ML**

More Resources

You may find a number of books useful as general background reading on specific topics covered in class, but these are by no means required texts for the course:

Additionally, the Global Communication Center (GCC) can provide assistance with the written or oral communication assignments in this class. The GCC is a free service, open to all students, and located in Hunt Library. You can learn more on the GCC website: cmu.edu/gcc.

Your Responsibilities

Attendance: Because much of this course is focused on discussion with your classmates, attending each session is important to both your ability to learn from the course and to contribute to what others get out of it as well. As such, you’ll be expected to attend every session and your participation will factor into your grade as described above. Should anything come up will require you to miss a class (illness, conferences, etc), please let one of the course staff know in advance.

Academic Integrity: Violations of class and university academic integrity policies will not be tolerated. Any instances of copying, cheating, plagiarism, or other academic integrity violations will be reported to your advisor and the dean of students in addition to resulting in an immediate failure of the course.

Resources

Students with Disabilities: We value inclusion and will work to ensure that all students have the resources they need to fully participate in our course. Please use the Office of Disability Resource’s online system to notify us of any necessary accommodations as early in the semester as possible. If you suspect that you have a disability but are not yet registered with the Office of Disability Resources, you can contact them at access@andrew.cmu.edu

Health and Wellness: As a student, you may experience a range of challenges that can interfere with learning, such as strained relationships, increased anxiety, substance use, feeling down, difficulty concentrating and/or lack of motivation. These mental health concerns or stressful events may diminish your academic performance and/or reduce your ability to participate in daily activities. CMU services are available, and treatment does work.
All of us benefit from support during times of struggle. There are many helpful resources available on campus and an important part of the college experience is learning how to ask for help. Asking for support sooner rather than later is almost always helpful.

If you or anyone you know experiences any academic stress, difficult life events, or feelings like anxiety or depression, we strongly encourage you to seek support. Counseling and Psychological Services (CaPS) is here to help: call 412-268-2922 and visit their website at cmu.edu/counseling/. Consider reaching out to a friend, faculty or family member you trust for help getting connected to the support that can help.

If you or someone you know is feeling suicidal or in danger of self-harm, call someone immediately, day or night:
CaPS: 412-268-2922
Re:solve Crisis Network: 888-796-8226

If the situation is life threatening, call the police
On campus: CMU Police: 412-268-2323
Off campus: 911

Discrimination and Harassment: Everyone has a right to feel safe and respected on campus. If you or someone you know has been impacted by sexual harassment, assault, or discrimination, resources are available to help. You can make a report by contacting the University’s Office of Title IX Initiatives by email (tix@andrew.cmu.edu) or phone (412-268-7125).

Confidential reporting services are available through the Counseling and Psychological Services and University Health Center, as well as the Ethics Reporting Hotline at 877-700-7050 or www.reportit.net (user name: tartans; password: plaid).
You can learn more about these options, policies, and resources by visiting the University’s Title IX Office webpage at https://www.cmu.edu/title-ix/index.html
In case of an emergency, contact University Police 412-268-2323 on campus or call 911 off campus.

Student Academic Success Center (SASC)

SASC focuses on creating spaces for students to engage in their coursework and approach learning through a variety of group and individual tutoring options. They offer many opportunities for students to deepen their understanding of who they are as learners, communicators, and scholars. Their workshops are free to the CMU community and meet the needs of all disciplines and levels of study. SASC programs to support student learning include the following (program titles link to webpages):