Context

Translating abstract fairness notions into precise, implementable definitions is essential for effective AI system evaluation. Without clear definitions, fairness remains aspirational but unmeasurable, making systematic improvement impossible.

Different fairness definitions embody distinct philosophical perspectives. Egalitarian views emphasizing equal outcomes align with demographic parity, while libertarian perspectives prioritizing procedural fairness align with individual fairness definitions. These philosophical commitments shape technical systems through fairness metric choices.

Mathematical formulations transform philosophical principles into statistical criteria for empirical evaluation. Group fairness definitions ensure similar outcomes across protected groups, individual fairness ensures similar individuals receive similar treatment, and counterfactual fairness examines how predictions would change if protected attributes differed. These implementation choices determine real-world outcomes—who receives loans, housing, or employment opportunities.

Legal frameworks further shape fairness requirements. U.S. anti-discrimination law distinguishes between disparate treatment and disparate impact, while EU regulations emphasize data protection and transparency. These frameworks create compliance requirements that fairness definitions must address in regulated domains.

Critically, mathematical impossibility results prevent simultaneously satisfying multiple fairness criteria. This necessitates explicit trade-offs based on application context and ethical priorities rather than pursuing contradictory objectives.

The Fairness Definition Selection Tool you'll develop in Unit 5 represents the second component of the Fairness Audit Playbook (Sprint Project). This tool will help you systematically select appropriate fairness definitions based on application context, ethical principles, and legal requirements, ensuring assessments address the most relevant dimensions for specific applications.

Learning Objectives

By the end of this Part, you will be able to:

Analyze philosophical foundations of different fairness definitions. You will evaluate how fairness definitions embody distinct philosophical perspectives on justice and equality, recognizing implicit values embedded in technical definitions rather than treating them as neutral mathematical formulations.
Implement mathematical formulations of various fairness criteria. You will translate abstract fairness concepts into precise mathematical definitions that can be quantitatively measured in AI systems, moving beyond vague aspirations to specific, calculable criteria.
Evaluate legal and regulatory implications of fairness definitions. You will assess how fairness definitions align with legal standards across jurisdictions and application domains, selecting definitions that satisfy relevant requirements while understanding where legal and technical approaches diverge.
Navigate trade-offs between competing fairness definitions. You will analyze inherent tensions between fairness criteria, including mathematical impossibility results that prevent simultaneously optimizing multiple definitions, making informed choices between competing fairness goals.
Develop contextual approaches to fairness definition selection. You will create methodologies for selecting appropriate fairness definitions based on application domain, stakeholder requirements, and ethical considerations, moving beyond one-size-fits-all approaches.

Units

Unit 1

Unit 1: Conceptual Foundations of Fairness

1. Conceptual Foundation and Relevance

Guiding Questions

Question 1: What does "fairness" actually mean in the context of algorithmic systems, and why do different stakeholders often have fundamentally different conceptions of what constitutes a fair outcome?
Question 2: How can abstract philosophical notions of fairness be translated into precise mathematical formulations that can guide the development of fair AI systems?

Conceptual Context

Understanding the conceptual foundations of fairness is essential for developing AI systems that align with our ethical values and societal expectations. Without a clear grasp of what fairness means in different contexts, any technical implementation will be built on an unstable foundation. This challenge is particularly acute because fairness is not a monolithic concept but rather a multi-faceted one with numerous, sometimes contradictory, interpretations.

As a data scientist or ML engineer, you routinely make decisions that implicitly encode specific fairness assumptions into your systems. These decisions range from how you frame the problem to which metrics you optimize for, and they have real consequences for the people affected by your models. The field of algorithmic fairness has demonstrated that seemingly technical choices about model design often embed normative judgments about what constitutes equitable treatment (Barocas, Hardt, & Narayanan, 2020).

This Unit builds on the historical patterns of discrimination explored earlier in the Sprint and provides the conceptual framework needed for the mathematical formulations of fairness you'll examine in the next Unit. The conceptual understanding you develop here will directly inform the Fairness Definition Selection Tool we will build in Unit 5, particularly in determining which fairness definitions are appropriate for specific contexts based on their underlying philosophical foundations.

2. Key Concepts

Turing College · Unit 1 - Key Concepts: Philosophical Perspectives on Fairness and Stakeholder Perspectives and Conflicting Goals

Philosophical Perspectives on Fairness

Fairness in AI systems derives from broader philosophical traditions that have developed over centuries. These philosophical frameworks offer different lenses through which to view what constitutes equitable treatment, and they often lead to divergent technical implementations in AI systems.

Understanding these philosophical foundations is essential because they shape how we define and measure fairness in computational contexts. Each philosophical perspective emphasizes different aspects of fairness, leading to distinct mathematical formulations and intervention approaches. Recognizing these differences helps explain why stakeholders may disagree about whether a system is "fair" despite looking at the same technical metrics.

Key philosophical perspectives include:

Egalitarianism emphasizes equality of outcomes across groups, suggesting that fair AI systems should produce similar results for different demographic groups regardless of other factors. This perspective often manifests in statistical parity metrics that compare prediction rates across protected groups.
Libertarianism focuses on procedural fairness and treatment of individuals based on relevant factors, suggesting that fair AI systems should make similar predictions for similar individuals regardless of protected attributes. This aligns with individual fairness metrics that emphasize consistency of treatment.
Rawlsian justice prioritizes improving outcomes for the least advantaged groups, suggesting that fair AI systems should optimize for minimum harm to the most vulnerable populations. This might manifest in metrics that minimize maximum disparity or that prioritize improvements for disadvantaged groups.
Utilitarianism emphasizes maximizing overall welfare, suggesting that fair AI systems should optimize for aggregate metrics while potentially accepting some disparities if they lead to better overall outcomes. This perspective often prioritizes accuracy or utility metrics alongside fairness constraints.

Binns (2018) demonstrates in his analysis of fairness definitions that these philosophical traditions directly inform how fairness is operationalized in ML systems. For example, demographic parity (equal prediction rates across groups) aligns with egalitarian perspectives, while equal opportunity (equal true positive rates) reflects a more meritocratic view that emphasizes treatment of "qualified" individuals (Binns, 2018).

For the Fairness Definition Selection Tool we'll develop in Unit 5, understanding these philosophical perspectives will be essential for mapping stakeholder values and domain-specific requirements to appropriate fairness definitions. Rather than assuming a universal definition of fairness, the framework will help you select definitions that align with the specific philosophical perspectives most relevant to your application context.

Stakeholder Perspectives and Conflicting Goals

Fairness in AI systems involves multiple stakeholders with potentially conflicting goals and perspectives on what constitutes fair treatment. These stakeholders include system developers, users, individuals subject to algorithmic decisions, regulatory bodies, and broader society. Each may prioritize different aspects of fairness and may evaluate system performance through different normative lenses.

This concept interacts with philosophical perspectives by showing how abstract principles take concrete form in specific contexts with real stakeholders. The plurality of legitimate stakeholder perspectives explains why fairness cannot be reduced to a single universal metric.

Mitchell et al. (2021) illustrate this through their analysis of the COMPAS recidivism prediction tool, where different stakeholder groups (defendants, judges, prosecutors, society at large) had fundamentally different conceptions of fairness. Defendants might prioritize equal false positive rates across groups (minimizing unfair detentions), while prosecutors might emphasize equal false negative rates (minimizing unfair releases). Society broadly might care about long-term impacts on recidivism rates and community well-being. No single fairness metric could satisfy all these legitimate concerns simultaneously (Mitchell et al., 2021).

For our Fairness Definition Selection Tool, understanding stakeholder perspectives will guide the development of a methodology for stakeholder analysis that identifies relevant perspectives, maps their concerns to specific fairness definitions, and provides approaches for navigating conflicting priorities. This ensures that fairness implementations address the concerns of those most affected by system decisions rather than defaulting to technically convenient metrics.

Fairness as Context-Dependent

Turing College · Unit 1: Fairness as Context-Dependent and Impossibility Theorems and Inherent Trade-offs

Fairness is inherently context-dependent, with appropriate definitions varying based on domain-specific factors, cultural contexts, historical patterns, and specific applications. This concept is crucial because it highlights that no single fairness definition is universally applicable across all AI systems. Instead, fairness must be tailored to the specific context in which a system operates.

This concept interacts with both philosophical perspectives and stakeholder analysis by showing how abstract principles and stakeholder concerns manifest differently across contexts. What might be considered fair in one domain could be inappropriate in another due to different historical patterns, social norms, or legal requirements.

Selbst et al. (2019) provide a compelling example in their research on "abstraction traps" in fair ML. They demonstrate how fairness implementations fail when they abstract away critical social and historical contexts. For instance, a "fair" hiring algorithm in the United States might require different considerations than one in India due to different historical discrimination patterns, legal frameworks, and social norms around protected attributes. Similarly, fairness in healthcare prediction has different requirements than fairness in criminal justice due to domain-specific factors like appropriate ground truth definitions and consequence asymmetries (Selbst et al., 2019).

For our Fairness Definition Selection Tool, this context dependency necessitates developing a structured approach for analyzing application domains to identify relevant historical patterns, legal requirements, domain-specific considerations, and cultural factors that should inform fairness definition selection. This ensures that fairness implementations respond to the specific challenges of each application context rather than applying one-size-fits-all solutions.

Impossibility Theorems and Inherent Trade-offs

Mathematical impossibility theorems in fairness demonstrate that multiple desirable fairness criteria cannot be simultaneously satisfied in most real-world scenarios. These formal results establish inherent trade-offs between competing fairness definitions, requiring explicit choices rather than assuming all fairness goals can be achieved simultaneously.

This concept connects directly to the plurality of philosophical perspectives and stakeholder goals, providing mathematical formalization of why these different perspectives cannot be fully reconciled. It shows that the challenge of fairness implementation is not merely technical but requires normative judgments about which fairness properties to prioritize in specific contexts.

Kleinberg, Mullainathan, and Raghavan (2016) proved that three desirable fairness properties—calibration, balance for the positive class, and balance for the negative class—cannot all be simultaneously satisfied except in trivial or exceptional cases. This means that system designers must inevitably prioritize some fairness properties over others, making choices that align with application-specific priorities (Kleinberg, Mullainathan, & Raghavan, 2016).

For our Fairness Definition Selection Tool, these impossibility results necessitate developing explicit approaches for navigating trade-offs between competing fairness definitions. The framework will need to help practitioners identify which combinations of fairness properties are mathematically incompatible, evaluate the relative importance of these properties in specific contexts, and document the rationale for prioritization decisions. This ensures that fairness implementations make deliberate, informed choices about inevitable trade-offs rather than pursuing contradictory objectives.

Domain Modeling Perspective

Turing College · Unit 1: Domain Modeling Perspective, Conceptual Clarification and Intersectionality Consideration

From a domain modeling perspective, fairness concepts map to different components of ML systems:

Problem Formulation: Philosophical perspectives influence how problems are framed and what is considered the ideal target outcome for prediction.
Data Representation: Context-specific fairness considerations determine which variables are appropriate to include and how they should be encoded.
Algorithm Selection: Different fairness definitions require different algorithmic approaches, from pre-processing to in-processing to post-processing.
Evaluation Framework: Stakeholder perspectives inform which metrics are prioritized and how different fairness measures are weighted.
Deployment Context: Cultural and domain-specific factors shape how systems are integrated into broader sociotechnical environments.

These domain components represent decision points where conceptual fairness considerations must be translated into technical implementation choices. The Fairness Definition Selection Tool will need to provide guidance for each of these components, ensuring that fairness considerations are integrated throughout the ML lifecycle rather than treated as an afterthought.

Conceptual Clarification

To clarify how these abstract fairness concepts apply in practice, consider these analogies:

Fairness definitions are like navigational instruments – a compass points to magnetic north, a GPS uses true north, and stellar navigation uses celestial positioning. Each provides valid directional guidance but might lead you to slightly different destinations. Similarly, different fairness definitions offer valid but potentially conflicting guidance on what constitutes a "fair" outcome, requiring context-specific selection rather than universal application.
Navigating fairness trade-offs is like managing an investment portfolio, where you cannot simultaneously maximize returns, minimize risk, and maintain perfect liquidity. Just as financial advisors help clients balance these competing objectives based on their specific goals and risk tolerance, fairness frameworks help practitioners balance competing fairness definitions based on application context and stakeholder priorities.

Intersectionality Consideration

Traditional fairness definitions often examine protected attributes independently, failing to capture how multiple forms of discrimination interact at demographic intersections. Intersectionality, a concept originated by legal scholar Crenshaw (1989), emphasizes that individuals at the intersection of multiple marginalized identities often face unique forms of discrimination that single-axis analysis misses.

Implementing intersectional fairness considerations presents challenges including:

Methodological complexity in modeling multiple, interacting protected attributes;
Statistical challenges with smaller sample sizes at demographic intersections; and
Computational difficulties in analyzing all possible demographic subgroups.

However, as Buolamwini and Gebru (2018) demonstrated in their Gender Shades research, systems that appear fair when analyzed along single axes (e.g., gender or skin tone separately) may show significant disparities at intersections (e.g., dark-skinned women). Their work found that commercial facial analysis algorithms had error rates of up to 34.7% for dark-skinned women compared to 0.8% for light-skinned men – a disparity that would remain hidden without intersectional analysis (Buolamwini & Gebru, 2018).

For our Fairness Definition Selection Tool, incorporating intersectionality requires developing approaches that extend fairness definitions to address multiple, overlapping protected attributes simultaneously. This includes methods for managing statistical challenges with smaller intersection sample sizes and strategies for prioritizing which intersections to focus on when comprehensive analysis is computationally infeasible.

3. Practical Considerations

Turing College · Unit 1: Practical Considerations

Implementation Framework

To systematically apply these conceptual fairness foundations to ML development, follow this structured methodology:

Context Analysis:
Document the specific domain, application, and deployment context.
Identify historical discrimination patterns relevant to your application.
Map relevant legal and regulatory requirements.
Analyze cultural contexts that might affect fairness expectations.
Stakeholder Mapping:
Identify all stakeholders affected by or involved with the system.
Document their perspectives on fairness and potential metrics they might prioritize.
Analyze power dynamics between stakeholders to identify whose perspectives might be underrepresented.
Develop engagement strategies for incorporating diverse viewpoints.
Fairness Definition Exploration:
Enumerate potential fairness definitions relevant to your context.
Map each definition to its philosophical foundations.
Identify mathematical relationships and potential trade-offs between definitions.
Assess alignment between definitions and stakeholder priorities.
Contextual Prioritization:
Develop explicit criteria for prioritizing among competing fairness definitions.
Document the rationale for selected priorities.
Create a decision framework for navigating identified trade-offs.
Establish processes for revisiting prioritization as context evolves.

This methodology integrates with standard ML workflows by extending requirements gathering and problem formulation to explicitly incorporate fairness considerations before technical implementation begins. The approach ensures that subsequent technical choices are guided by clear conceptual foundations rather than implicit assumptions.

Implementation Challenges

When applying these conceptual frameworks, practitioners commonly encounter the following challenges:

Stakeholder Disagreement: Different stakeholders often have fundamentally different perspectives on what constitutes fairness in a specific context. Address this by:
Creating structured processes for surfacing and documenting different perspectives.
Developing clear communication frameworks for explaining trade-offs to non-technical stakeholders.
Establishing decision frameworks for prioritizing competing concerns when consensus is not possible.
Translating Concepts to Metrics: Abstract fairness concepts must be translated into specific, measurable properties. Address this challenge by:
Creating explicit mappings between conceptual principles and mathematical definitions.
Developing validation approaches to verify that metrics actually capture intended concepts.
Establishing contextual thresholds for what level of disparity is acceptable.

Successfully navigating these challenges requires both technical expertise in fairness metrics and domain knowledge about the specific context of application. It also requires strong communication skills to explain complex trade-offs to diverse stakeholders and an organizational commitment to deliberate fairness implementation rather than defaulting to technically convenient approaches.

Evaluation Approach

To assess whether your conceptual fairness approach is effective, implement these evaluation strategies:

Stakeholder Satisfaction Assessment:
Engage diverse stakeholders to evaluate whether selected fairness definitions align with their concerns.
Document areas of agreement and persistent tensions.
Establish acceptable thresholds for stakeholder alignment.
Context Alignment Evaluation:
Assess whether selected fairness definitions address identified historical patterns.
Verify compliance with relevant legal and regulatory requirements.
Evaluate compatibility with domain-specific constraints and objectives.
Trade-off Documentation:
Explicitly document identified trade-offs between competing fairness definitions.
Quantify impacts of prioritization decisions on different stakeholder groups.
Create visual representations of the fairness-performance frontier to illustrate trade-offs.

These evaluation approaches should be integrated into your organization's broader fairness assessment framework, providing the conceptual foundation for more technical evaluations in subsequent development stages.

4. Case Study: College Admissions Algorithm

Turing College · Unit 1: Case Study, Scenario Context and Problem Analysis

Scenario Context

A prestigious university is developing a machine learning algorithm to assist in undergraduate admissions decisions. The system will analyze applicant data—including academic performance, extracurricular activities, recommendation letters, and demographic information—to predict "success potential," a composite metric combining expected graduation rates, academic performance, and post-graduation outcomes.

Key stakeholders include university administrators concerned with institutional outcomes and reputation, admissions officers who will use the system alongside human judgment, prospective students from diverse backgrounds, and regulatory bodies focused on educational equity. Fairness is particularly critical in this domain due to historical patterns of educational discrimination and the life-altering impact of admissions decisions on individual applicants.

Problem Analysis

Applying core fairness concepts to this scenario reveals several conceptual challenges:

Philosophical Tensions: Different stakeholders bring distinct philosophical perspectives to the admissions process. University administrators may emphasize utilitarian goals of maximizing overall student success and institutional outcomes. Prospective students may prioritize procedural fairness and equal opportunity based on relevant qualifications. Community advocates might focus on egalitarian outcomes that increase representation of historically marginalized groups.
Contextual Complexities: The admissions context includes specific historical patterns of discrimination in education, legal frameworks such as affirmative action policies and anti-discrimination laws, and domain-specific considerations about what constitutes relevant qualification factors versus irrelevant biasing influences.
Stakeholder Conflicts: Tension exists between current applicants who want decisions based solely on individual merit, community advocates concerned with historical exclusion of certain groups, and institutional interests in both diversity and academic excellence. No single fairness definition can fully satisfy all these stakeholder perspectives.
Intersectional Considerations: Applicants at intersections of multiple identity dimensions (e.g., low-income students of color from rural areas) may face unique barriers that single-axis fairness analyses would miss. The admissions algorithm must consider how different factors interact rather than treating demographic attributes independently.
Impossibility Constraints: Mathematical impossibility theorems demonstrate that the algorithm cannot simultaneously achieve perfect representation parity across all demographic groups, identical true positive rates for qualified applicants, and equal calibration of success predictions—forcing explicit prioritization decisions.

Solution Implementation

Turing College · Unit 1: Case study, Solution Implementation, Outcomes and Lessons

To address these conceptual challenges, the university implemented a structured approach:

For Philosophical Tensions, they:
Conducted a philosophical analysis of different fairness conceptions in educational contexts.
Documented explicit values statements about the university's commitments to both excellence and equity.
Developed a hybrid framework that incorporated elements of multiple philosophical traditions—emphasizing equal opportunity for similarly qualified applicants while also considering representational goals.
For Contextual Complexities, they:
Analyzed historical admissions data to identify patterns of advantage and disadvantage.
Mapped relevant legal requirements, including specific guidance on the permissible consideration of protected attributes.
Developed context-specific fairness definitions that reflected educational domain knowledge about relevant qualification factors.
For Stakeholder Conflicts, they:
Conducted extensive stakeholder engagement through focus groups, surveys, and deliberative processes.
Created a multi-stakeholder advisory board with representatives from diverse perspectives.
Developed a weighted framework that balanced different stakeholder priorities while giving special consideration to those most affected by potential biases.
For Intersectional Considerations, they:
Conducted specific analyses of outcomes for applicants at the intersection of multiple marginalized identities.
Implemented specialized review processes for applicants from intersectional backgrounds with limited historical representation.
Developed composite features that captured how multiple disadvantage factors might interact.
For Impossibility Constraints, they:
Created explicit documentation of identified trade-offs between competing fairness definitions.
Established a contextual prioritization that emphasized equal opportunity metrics while setting minimum thresholds for representation metrics.
Implemented a monitoring system that tracked multiple fairness metrics to ensure that no single dimension was severely compromised.

Throughout implementation, they maintained clear documentation of their conceptual framework, the rationale behind prioritization decisions, and the processes for revisiting these decisions as contexts evolved.

Outcomes and Lessons

The implementation resulted in several measurable improvements:

Stakeholder satisfaction increased by 45% compared to the previous admissions process, with particularly significant improvements among historically underrepresented applicant groups.
The explicit documentation of trade-offs reduced internal disputes about fairness approaches by 65%, creating more productive conversations about prioritization.
The admissions committee reported that the conceptual clarity about different fairness definitions improved their ability to explain decisions to applicants by 78%.

Key challenges remained, including persistent tensions between individual and group conceptions of fairness and the difficulty of establishing ground truth for the "success potential" target variable without perpetuating historical biases.

The most generalizable lessons included:

The critical importance of conducting philosophical and stakeholder analysis before implementing technical fairness measures.
The value of explicit documentation of trade-offs and prioritization decisions in navigating contentious fairness questions.
The effectiveness of multi-metric evaluation frameworks that track multiple fairness dimensions rather than optimizing for a single definition.

These insights directly informed the development of the Fairness Definition Selection Tool, particularly in creating structured approaches for stakeholder analysis, contextual prioritization, and trade-off documentation.

5. Frequently Asked Questions

FAQ 1: Balancing Different Stakeholder Perspectives

Q: How should we navigate situations where different stakeholders have fundamentally incompatible conceptions of fairness?
A: When stakeholders have incompatible fairness definitions, implement a structured prioritization process rather than seeking perfect consensus. First, clearly document each stakeholder's perspective and map them to specific fairness definitions. Then, analyze power dynamics to ensure that historically marginalized voices are not overlooked. Next, identify any minimal requirements that all perspectives consider necessary, even if insufficient. Finally, make explicit prioritization decisions based on application-specific factors such as legal requirements, ethical principles relevant to your domain, and the comparative impacts of different approaches on affected groups. Document your reasoning transparently so stakeholders understand why certain perspectives were given greater weight, and implement monitoring across multiple fairness metrics to ensure that deprioritized concerns do not fall below acceptable thresholds.

FAQ 2: Determining Which Fairness Definition Is "Right"

Q: Is there a way to determine which fairness definition is objectively "right" for a specific application?
A: No single fairness definition is objectively "right" across all contexts. The appropriate definition depends on domain-specific factors, historical patterns, stakeholder perspectives, and legal requirements. Rather than seeking a universally correct definition, focus on a context-appropriate selection process. Analyze your specific application domain, historical discrimination patterns, and stakeholder priorities. Map these considerations to philosophical fairness traditions and their corresponding mathematical definitions. Document the inevitable trade-offs between competing definitions and make explicit, reasoned choices about which aspects of fairness to prioritize in your specific context. The "right" definition is one that (1) addresses the specific fairness challenges most relevant to your application, (2) aligns with stakeholder values and legal requirements, and (3) acknowledges and mitigates the most significant potential harms to affected individuals.

FAQ 3: Intersectional Fairness Analysis in Loan Approval Systems

Q: In developing a loan approval system, stakeholders disagree about appropriate fairness metrics. The development team proposes implementing intersectional fairness analysis. Which statement most accurately describes the impact of this approach according to current research?
A: Intersectional analysis will reveal potentially hidden fairness disparities at demographic intersections that single-attribute analysis might miss, while still requiring explicit trade-off decisions between competing fairness definitions.

Option 1 is incorrect because intersectional analysis does not resolve stakeholder disagreements by identifying a universal fairness definition.
Option 2 is incorrect because intersectional analysis does not automatically ensure that the model satisfies demographic parity across all subgroup combinations.
Option 4 is incorrect because removing all protected attributes and their proxies does not eliminate bias but rather masks it.
Option 3 correctly characterizes intersectional analysis as a comprehensive evaluation approach that uncovers important disparities while acknowledging that explicit trade-off decisions between fairness definitions remain necessary (Buolamwini & Gebru, 2018; Kearns et al., 2018; Foulds et al., 2020).

6. Summary and Next Steps

Turing College · Unit 1: Summary

Key Takeaways

The conceptual foundations of fairness provide the essential basis for all subsequent technical implementations. The key concepts from this Unit include:

Philosophical perspectives on fairness derive from different ethical traditions and directly inform how fairness is operationalized in AI systems.
Stakeholder perspectives often differ fundamentally on what constitutes fair treatment, requiring explicit analysis and prioritization.
Fairness is context-dependent, varying based on domain-specific factors, cultural contexts, and historical patterns.
Impossibility theorems demonstrate that multiple desirable fairness criteria cannot be simultaneously satisfied, requiring explicit trade-off decisions.

These concepts directly address our guiding questions by explaining why different stakeholders have divergent fairness conceptions and by providing a structured approach for translating philosophical principles into specific fairness definitions appropriate for particular contexts.

Application Guidance

To apply these concepts in your practical work:

Begin any fairness implementation with explicit context analysis and stakeholder mapping before selecting specific metrics.
Document the philosophical foundations of the different fairness definitions you are considering and their alignment with your application context.
Identify and explicitly acknowledge trade-offs between competing fairness definitions rather than assuming that all desired properties can be achieved simultaneously.
Implement structured decision processes for navigating these trade-offs based on contextual priorities.

For organizations new to fairness considerations, start by focusing on comprehensive stakeholder engagement and clear documentation of different perspectives before attempting technical implementations. This foundation will inform all subsequent technical choices and help avoid costly rework when implicit assumptions about fairness prove problematic.

Looking Ahead

In the next Unit, we will build on this conceptual foundation by examining the mathematical formulations of fairness. You will learn how abstract philosophical principles translate into precise mathematical definitions that can be empirically measured and optimized. These formulations will provide the technical framework needed to implement the conceptual principles we have explored here.

The conceptual foundations we have established will guide which mathematical formulations are appropriate for specific contexts and how to navigate the inevitable trade-offs between competing fairness definitions. This connection between philosophical principles and mathematical implementation is essential for developing AI systems that achieve their intended fairness goals rather than optimizing for misaligned metrics.

References

Barocas, S., Hardt, M., & Narayanan, A. (2020). Fairness and machine learning: Limitations and opportunities. Retrieved from https://fairmlbook.org/

Binns, R. (2018). Fairness in machine learning: Lessons from political philosophy. In Proceedings of the 1st Conference on Fairness, Accountability, and Transparency (pp. 149–159). PMLR.

Buolamwini, J., & Gebru, T. (2018). Gender shades: Intersectional accuracy disparities in commercial gender classification. In Proceedings of the 1st Conference on Fairness, Accountability, and Transparency (pp. 77–91). PMLR.

Chouldechova, A. (2017). Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big Data, 5(2), 153–163.

Crenshaw, K. (1989). Demarginalizing the intersection of race and sex: A Black feminist critique of antidiscrimination doctrine, feminist theory and antiracist politics. University of Chicago Legal Forum, 1989(1), 139–167.

Foulds, J. R., Islam, R., Keya, K. N., & Pan, S. (2020). An intersectional definition of fairness. In 2020 IEEE 36th International Conference on Data Engineering (ICDE) (pp. 1918–1921). IEEE.

Kearns, M., Neel, S., Roth, A., & Wu, Z. S. (2018). Preventing fairness gerrymandering: Auditing and learning for subgroup fairness. In International Conference on Machine Learning (pp. 2564–2572). PMLR.

Kleinberg, J., Mullainathan, S., & Raghavan, M. (2016). Inherent trade-offs in the fair determination of risk scores. arXiv preprint arXiv:1609.05807.

Mitchell, S., Potash, E., Barocas, S., D'Amour, A., & Lum, K. (2021). Algorithmic fairness: Choices, assumptions, and definitions. Annual Review of Statistics and Its Application, 8, 141–163.

Selbst, A. D., Boyd, D., Friedler, S. A., Venkatasubramanian, S., & Vertesi, J. (2019). Fairness and abstraction in sociotechnical systems. In Proceedings of the Conference on Fairness, Accountability, and Transparency (pp. 59–68).

Unit 2

Unit 2: Mathematical Formulations of Fairness