This course examines the greater context under which the practice of Data Science exists and explores concrete ways issues of fairness surface in the technical work of a Data Scientist. Much of the work of a Data Scientist contribute to decision making processes, either through algorithmic systems or informing policy. The course will survey frameworks for studying the objectives and impacts of such decisions, paying particular attention to how such decisions affect a diverse population of individuals.
The course will ground, motivate, and contextualize these frameworks in the experiences of individuals and communities impacted most by decision making systems. As participants in the course, we will relate to these individuals through the critical lens of our own experience. Concretely, the course will dedicate significant time to the study of particular histories and contexts of marginalized individuals and communities as a necessary component of any analysis of these decision making systems.
At the end of the course, student will be able to:
Together, these objectives form a toolkit that students can use in critically analyzing the impacts of a decision making system, diagnose potential sources of bias, and constructively discuss the implications of decision choices in terms of values and the harms incurred by others.
The course will broadly cover the following topics:
The methods used in the course regularly incorporate experiential and practical approaches to solidify and expand understanding of the course’s lecture-based, topical material. A few of these methods are outlined below:
This course is interdisciplinary by design. Assignments necessitate reading papers from the humanities and social sciences, deriving probabilistic/statistical models, and writing code. I assume you have some exposure to all of these areas through DSC 80 and standard exposure to critical reasoning typical of an upper-division course. However, much of the material in this course will require pushing yourself to understand topics outside your comfort zone; a curious, open mind, with a healthy amount of perseverance is most important.
The goal in this course is to meaningfully engage with and relate to the material; it’s not necessary to understand every line of your reading. Start by carefully reading the abstract, introduction, and conclusion (skimming the rest). Later, read over the work in its entirety. If you are still finding understanding a topic difficult, you should ask your classmates, then reach out to the instructor or TA!
This course will also approach high-stakes and controversial topics. These discussions may occur during lecture, section, among student conversations in breakout rooms, or in peer reviews. Learning to have respectful discussions about such topics is a valuble life skill for a data scientist, who tends to interact with a wide variety of people.
Here are a few guidelines for respectful conversation we will follow in this course:
(Taken from UMich CRLT).
This course will be in-person, as listed in the course catalog. For those who cannot atttend lecture, podcasts of the lectures will be available after the in-person meeting. If you participate in the class asynchronously, please watch lecture within 24 hours of the scheduled lecture to stay up-to-date with the weekly readings.
Each week, complete the reading assignment found in the schedule and write your response to the given prompt by Mondays at 11:59PM.
Upon submission of the reading response, you must complete your assigned peer review by the following Wednesday at 11:59PM (48 hours later). These assignments are graded on a 0-2 scale.
Grade: 20% of Total
Write a paper that identifies and analyzes a potential inequity in the world, using data. You may either replicate a known publication or find a topic yourself. See the assignment.
This project may be worked on in pairs.
Grade: 40% of Total
In this paper, you will:
The audit may lead to studying different approaches to potential unfair decisions by the system. Your analysis must at least 2 of the topics covered in the second half of the course. See the assignment.
This project may be worked on in pairs.
Grade: 40% of Total
Week | Topic | Assignments |
---|---|---|
Week 1 | Introduction, Frameworks of Distributive Justice | |
Week 2 | Measurement, Data, and Decision Making | |
Week 3 | Measuring Discrepancies at the Group Level and relationships to Fairness | |
Week 4 | Score Functions, Calibration, and Creating ‘Fair’ Classifiers | |
Week 5 | Limits of Parity Measures: intersectionality, infra-marginality | |
Week 6 | Fairness and Discrimination at the Individual Level | Paper 1 Due |
Week 7 | Bias in the ML pipeline: pre-processing, in-processing, post-processing | |
Week 8 | Amplification of Bias and Feedback Loops | |
Week 9 | Representational Harms I: Stereotyping; Fairness in Feature Space | |
Week 10 | Representational Harms II: NLP, 3rd Party APIs | |
Finals Week | Paper 2 Due |
nbconvert
command-line, with code-strip
option.I aim to create an environment in which all students can succeed in this course. If you are experiencing obstacles to learning and engagement in the course at any point during the quarter, don’t hesistate to contact me. If you need an accommodation for whatever reason, I will try to work with you to realize the accommodation in some respect.
If you are requesting accommodations for this course due to a disability, you must provide a current Authorization for Accommodation (AFA) letter issued by the Office for Students with Disabilities (OSD). Students are required to present their AFA letters to Faculty (please make arrangements to contact me privately) and to the DSC Student Advisor in advance so that accommodations may be arranged. Contact the OSD for further information: 858.534.4382 (phone) osd@ucsd.edu (email) http://disabilities.ucsd.edu (website)
If you have feedback on how to make the class more accessible and inclusive, please let me know.
We are committed to fostering a learning environment for this course that supports a diversity of thoughts, perspectives and experiences, and respects your identities (including race, ethnicity, heritage, gender, sex, class, sexuality, religion, ability, age, educational background, etc.). Our goal is to create a diverse and inclusive learning environment where all students feel comfortable and can thrive.
Our instructional staff will make a concerted effort to be welcoming and inclusive to the wide diversity of students in this course. If there is a way we can make you feel more included please let one of the course staff know, either in person, via email/discussion board, or even in a note under the door. Our learning about diverse perspectives and identities is an ongoing process, and we welcome your perspectives and input.
We also expect that you, as a student in this course, will honor and respect your classmates, abiding by the UCSD Principles of Community (https://ucsd.edu/about/principles.html). Please understand that others’ backgrounds, perspectives and experiences may be different than your own, and help us to build an environment where everyone is respected and feels comfortable.
If you experience any sort of harassment or discrimination, please contact the instructor as soon as possible. If you prefer to speak with someone outside of the course, please contact the Office of Prevention of Harassment and Discrimination: https://ophd.ucsd.edu/.
In this course we expect students to adhere to the UC San Diego Integrity of Scholarship Policy. This means that you will complete your work honestly, with integrity. Some examples of specific ways this policy applies to DSC 167 include: