1
“AI systems don’t just reflect our biases — they amplify them at scale.”
- Real-world harms are already documented: Amazon’s hiring AI penalized women, COMPAS scored Black defendants as higher risk, facial recognition fails on dark-skinned women.
- Core ethical principles: fairness, transparency, accountability, privacy, and safety — these apply to every AI system.
- The EU AI Act (2024) is the world’s first comprehensive AI law. AI ethics is no longer optional — it’s becoming law.
2
“Bias doesn’t start with the algorithm — it starts with the world.”
- Seven sources of bias: historical, representation, selection, measurement, label, temporal, and algorithmic.
- Feedback loops amplify bias over time — biased predictions create biased data that trains more biased models.
- Bias detection methods: disaggregated evaluation, disparate impact testing, counterfactual testing, and red teaming.
3
“You cannot satisfy all fairness definitions simultaneously — the impossibility theorem proves it.”
- Three key fairness definitions: demographic parity (equal selection rates), equalized odds (equal error rates), calibration (equal accuracy of scores).
- The impossibility theorem (Chouldechova, 2017): you cannot satisfy all three simultaneously when base rates differ across groups.
- Fairlearn is the leading open-source toolkit for measuring and improving fairness in ML models.
4
“Bias mitigation is not a one-time fix — it’s a continuous process across the entire ML lifecycle.”
- Three intervention points: pre-processing (fix the data), in-processing (constrain the model), post-processing (adjust the outputs).
- LLM debiasing: RLHF, Constitutional AI, prompt engineering, output filtering, and representation engineering.
- The fairness-accuracy trade-off is real but often smaller than expected — typically 1–3% accuracy loss for significant fairness gains.
Bottom line: Bias is systemic, not accidental. It enters AI systems through data, design, and deployment. Fairness definitions conflict (impossibility theorem), so you must choose which definition fits your context. Mitigation is continuous, not one-time.