Context

Understanding where and how bias enters AI systems is crucial for effective fairness assessment and intervention. While Part 1 established the historical patterns of discrimination that persist in technology and Part 2 provided precise fairness definitions, Part 3 examines the specific mechanisms through which bias manifests throughout the machine learning lifecycle.

Bias in AI systems emerges through multiple pathways. These include historical bias (reflecting existing inequities), representation bias (uneven sampling across groups), measurement bias (flawed operationalization of concepts), and deployment bias (misalignment between development and application contexts). These categories help practitioners move beyond treating bias as a monolithic issue to identifying specific mechanisms requiring targeted interventions.

Data serves as the foundation for ML systems, making data-level biases particularly influential. Sampling procedures that underrepresent marginalized groups, measurement approaches that embed problematic assumptions, and feature engineering decisions that prioritize certain characteristics all introduce bias before model training begins.

Beyond data issues, bias emerges through algorithmic design choices and system dynamics. Optimization objectives that prioritize overall accuracy often underserve minority groups, while feedback loops can amplify small initial disparities into significant fairness concerns over time, particularly in systems where predictions influence future data collection.

The Bias Source Identification Tool you'll develop in Unit 5 represents the third component of the Fairness Audit Playbook (Sprint Project). This tool will help you systematically identify potential bias sources at different stages of the ML lifecycle, ensuring that your fairness assessments and interventions address root causes rather than merely symptoms.

Learning Objectives

By the end of this Part, you will be able to:

Classify different types of bias using taxonomic frameworks. You will apply systematic frameworks to categorize biases by type, source, and lifecycle stage, moving beyond vague assessments of "unfairness" to precisely identify specific bias mechanisms.
Analyze how data collection and representation choices introduce bias. You will examine how sampling procedures, measurement approaches, and feature engineering decisions embed biases in training data, identifying potential fairness issues at the data foundation.
Evaluate how algorithm design and implementation choices affect fairness. You will assess how model architecture, optimization objectives, and hyperparameter choices can amplify or mitigate biases, recognizing how technical decisions impact fairness outcomes.
Identify feedback loops and system dynamics that amplify biases. You will analyze how system interactions and deployment contexts create self-reinforcing cycles that magnify biases over time, addressing dynamic fairness concerns rather than viewing bias as static.
Develop systematic methodologies for tracing unfairness to specific sources. You will create structured approaches for connecting observed fairness disparities to their underlying causes in complex systems, enabling targeted interventions that address fundamental issues rather than symptoms.

Units

Unit 1

Unit 1: Data Collection and Representation Biases

Turing College · 113. Conceptual Foundation and Relevance

1. Conceptual Foundation and Relevance

Guiding Questions

Question 1: How do sampling procedures, feature selection, and measurement choices in data collection embed or amplify existing biases in AI systems?
Question 2: What systematic approaches can data scientists implement to identify, quantify, and mitigate representation biases before they become encoded in model parameters?

Conceptual Context

Understanding data collection and representation biases forms the critical first step in addressing fairness in AI systems. These biases represent the foundation upon which all subsequent modeling decisions rest. If biased data enters your pipeline, even the most sophisticated fairness interventions at later stages may prove insufficient to create truly fair outcomes.

Data collection and representation biases are particularly insidious because they often appear as technical or methodological decisions rather than explicit fairness concerns. Choices about which features to measure, how to operationalize concepts, where to gather samples, and how to encode categorical variables can embed historical patterns of discrimination into seemingly objective datasets. As Obermeyer, Powers, Vogeli, and Mullainathan (2019) demonstrated in their analysis of healthcare algorithms, even when protected attributes are excluded, the selection of proxy variables and measurement approaches can perpetuate significant biases that directly impact vulnerable populations (Obermeyer et al., 2019).

This Unit builds on the historical foundations established at the beginning of this Sprint and will serve as the basis for exploring algorithm design biases in Unit 2 and feedback loop amplification in Unit 3. The insights you develop here will directly inform the Bias Source Identification Tool we will develop in Unit 5, particularly in identifying data-level entry points where bias can infiltrate ML systems.

2. Key Concepts

Turing College · 113. Unit 1: Key Concepts - Historical Bias in Data Collection and Sampling and Selection Bias

Historical Bias in Data Collection

Historical bias occurs when data reflect existing prejudices, inequalities, or discriminatory practices in society—even when the data collection process itself appears statistically sound. This concept is crucial for AI fairness because machine learning systems trained on such data will inevitably learn and potentially amplify these historical patterns unless specific interventions are implemented.

Historical bias interacts with other forms of data bias by creating the underlying conditions in which they operate. For instance, sampling bias (discussed below) becomes particularly problematic when it intersects with historical bias, as the underrepresentation of certain groups compounds with historically biased measurements to create multiple layers of disadvantage.

Research by Buolamwini and Gebru (2018) provides a concrete application of this concept in facial recognition systems. They found that commercial facial analysis algorithms exhibited accuracy disparities of up to 34.4% between lighter-skinned males and darker-skinned females. These disparities stemmed directly from historical biases in benchmark datasets that severely underrepresented darker-skinned individuals, particularly women (Buolamwini & Gebru, 2018). The practical implication is that technologies deployed using such algorithms would systematically provide worse service to already marginalized groups.

For the Bias Source Identification Tool we will develop, understanding historical bias will be essential for distinguishing between bias patterns that emerge from data collection practices versus those introduced during model development. This distinction directs where in the ML pipeline interventions should be targeted and what forms they should take.

Sampling and Selection Bias

Sampling bias occurs when the process of data collection results in a dataset that does not accurately represent the population on which the model will ultimately be deployed. This concept is fundamental to AI fairness because models generalize based on the patterns present in their training data; if certain groups are underrepresented or overrepresented, the model will perform disproportionately well or poorly on those groups.

Sampling bias often interacts with measurement bias (discussed next) by influencing not just who appears in datasets but how their characteristics are measured, potentially creating compounding effects where marginalized groups are both underrepresented and less accurately characterized.

A powerful application example comes from Larson, Mattu, Kirchner, and Angwin’s (2016) investigation of COMPAS recidivism prediction algorithms, which demonstrated how sampling bias in criminal justice data led to significantly higher false positive rates for Black defendants compared to White defendants. The data reflected historical patterns of over-policing in certain communities, creating a feedback loop in which predictions based on biased samples reinforced discriminatory practices (Larson et al., 2016).

For our Bias Source Identification Tool, identifying sampling bias will require examining both the demographic distribution of datasets and the processes by which those datasets were constructed. This analysis will guide recommendations for data augmentation, reweighting, or the collection of additional samples to address representation disparities before model development begins.

Measurement Bias

Turing College · 113. Unit 1: Key Concepts - Measurement Bias and Feature Representation and Encoding Bias

Measurement bias emerges when the features selected, the variables operationalized, or the metrics chosen for a machine learning task systematically disadvantage certain groups. This concept is critical for AI fairness because seemingly technical choices about what to measure and how to measure it embed assumptions that can create or reinforce disparities.

Measurement bias connects deeply with both historical and sampling biases, as measurement choices often reflect historical practices and are constrained by available samples. The interplay between these biases creates complex patterns that require multifaceted analysis and intervention.

Research by Obermeyer et al. (2019) provides a striking example of measurement bias in healthcare. They discovered that an algorithm widely used to identify patients for additional care resources systematically discriminated against Black patients. The bias stemmed from using healthcare costs as a proxy for healthcare needs—a measurement choice that failed to account for historical inequities in healthcare access. Although Black patients had the same level of illness as White patients, they generated lower costs due to structural barriers to healthcare access, resulting in the algorithm systematically underrating their need for additional care (Obermeyer et al., 2019).

For the Bias Source Identification Tool component of our Sprint Project, understanding measurement bias will guide the development of systematic questionnaires and analysis approaches to examine how feature selection, variable operationalization, and metric definition might introduce fairness issues across different application domains. This will ensure the framework can identify bias sources across diverse data types and measurement approaches.

Feature Representation and Encoding Bias

Feature representation and encoding bias occurs when the way features are transformed, normalized, categorized, or encoded systematically disadvantages certain groups. This concept is essential for AI fairness because technical choices about data representation that appear neutral can actually embed or amplify biases when they interact with group differences in feature distributions or semantics.

This form of bias interacts with measurement bias but focuses specifically on how measurements are represented in the final dataset rather than on what is being measured. Both aspects require careful examination to identify potential fairness issues.

As an application example, consider research by Bolukbasi, Chang, Zou, Saligrama, and Kalai (2016) on bias in word embeddings, which demonstrated how standard encoding methods for text data captured and amplified gender stereotypes present in training corpora. These embeddings then propagated these biases to downstream applications that used them as feature representations. Their work showed that the analogy “man is to computer programmer as woman is to homemaker” emerged in standard word embeddings, demonstrating how encoding choices embedded historical gender disparities (Bolukbasi et al., 2016).

For our Bias Source Identification Tool, analyzing feature representation and encoding bias will require a systematic examination of data transformation pipelines, normalization procedures, and encoding schemes to identify potential disparate impacts across groups. This analysis will inform recommendations for alternative representation approaches that minimize bias while preserving necessary information content.

Domain Modeling Perspective

Turing College · 113. Unit 1: Key Concepts - Domain Modeling Perspective and Intersectionality Consideration

From a domain modeling perspective, data collection and representation biases map directly to specific components of ML systems:

Data Collection Processes: Sampling procedures, inclusion/exclusion criteria, and data gathering methodologies all present entry points for bias.
Feature Definition: Operationalizing real-world concepts into measurable features involves decisions that can embed unfair assumptions.
Data Transformation Pipeline: Preprocessing steps—including normalization, binning, encoding, and imputation—can amplify or introduce biases.
Dataset Documentation: Metadata about how data were collected and transformed provides crucial context for identifying potential bias sources.

These domain components form the earliest stages in the ML lifecycle where bias can enter, making them critical control points for fairness interventions. The Bias Source Identification Tool will need to provide systematic approaches for analyzing each of these components to identify specific mechanisms through which bias enters training data.

Intersectionality Consideration

Data collection and representation biases present unique challenges for intersectional fairness analysis, where multiple protected attributes interact to create distinct patterns of advantage or disadvantage. Datasets often have particularly poor representation at demographic intersections, creating amplified bias effects for individuals with multiple marginalized identities.

For example, as demonstrated by Buolamwini and Gebru (2018), facial recognition systems may show acceptable aggregate performance across gender (combining all races) and across race (combining all genders), while exhibiting significant accuracy disparities at specific intersections such as "dark-skinned women." These intersectional effects remain hidden unless explicitly analyzed (Buolamwini & Gebru, 2018).

In practical implementation, addressing intersectional considerations in data collection requires:

Intentional sampling strategies that ensure adequate representation across demographic intersections, not just primary groups;
Measurement approaches that are validated across intersectional subgroups to ensure consistent quality;
Encoding methods that preserve intersectional information rather than flattening to single-attribute categories; and
Analysis frameworks that explicitly examine bias patterns at demographic intersections rather than treating protected attributes independently.

The Bias Source Identification Tool must incorporate these intersectional considerations by developing analysis approaches that systematically examine how bias manifests across demographic intersections, even when sample sizes at those intersections are limited.

3. Practical Considerations

Turing College · 113. Unit 1: Practical Considerations

Implementation Framework

To systematically identify and address data collection and representation biases, implement the following structured methodology:

Dataset Demographic Audit:
Analyze the demographic distribution of your dataset across protected attributes and their intersections.
Compare this distribution to relevant population benchmarks to identify representation disparities.
Calculate representation ratios and statistical significance of observed disparities.
Collection Process Analysis:
Document how samples were selected and what inclusion/exclusion criteria were applied.
Identify potential selection mechanisms that might create systematic under- or overrepresentation.
Analyze geographic, temporal, and contextual factors that influenced data collection.
Feature Construction Examination:
For each feature, document how it was operationalized and measured.
Analyze whether measurement approaches have been validated across demographic groups.
Identify potential proxies for protected attributes that might enable indirect discrimination.
Transformation Pipeline Audit:
Review normalization, encoding, and imputation procedures for potential disparate impacts.
Test alternative encoding methods and evaluate differences in resulting distributions.
Analyze how missing data patterns vary across groups and how imputation might affect fairness.

These methodologies integrate with standard ML workflows by extending data profiling and exploratory data analysis to explicitly incorporate fairness considerations. While they add additional analysis requirements, they leverage many existing data science practices while reorienting them toward fairness evaluation.

Implementation Challenges

When implementing these approaches, practitioners commonly encounter the following challenges:

Limited Demographic Information: Many datasets lack protected attribute information, making direct bias assessment difficult. Address this by:
Using validated proxy variables when appropriate (with careful documentation of limitations);
Performing sensitivity analysis to estimate potential bias ranges under different assumptions; and
Collecting additional demographic data when possible, with appropriate privacy protections.
Stakeholder Alignment on Fairness Definitions: Different organizational stakeholders may have conflicting fairness priorities. Address this by:
Documenting explicit fairness definitions and metrics before beginning analysis;
Creating visualizations that illustrate trade-offs between different fairness definitions; and
Developing clear communication frameworks for explaining technical bias concepts to nontechnical stakeholders.

Successfully implementing data bias analysis requires computational resources for detailed distribution analysis, expertise in both statistical methods and domain knowledge of how bias manifests in specific contexts, and organizational commitment to addressing identified issues—even when they require additional data collection or preparation efforts.

Evaluation Approach

To assess whether your bias identification and mitigation approaches are effective, implement these evaluation strategies:

Comparative Distribution Analysis:
Calculate statistical distance metrics (e.g., Kullback–Leibler divergence, Earth Mover's distance) between distributions of features across demographic groups.
Set acceptable thresholds based on domain-specific fairness requirements.
Document distribution changes after bias mitigation interventions.
Representation Metrics:
Calculate representation disparity metrics showing how sample proportions deviate from population benchmarks.
Establish minimum representation thresholds for demographic intersections based on statistical power requirements.
Track improvements in representation through data augmentation or reweighting.
Measurement Validation:
Assess feature validity across demographic groups through correlation analysis with ground truth when available.
Establish acceptable bounds for measurement differences between groups.
Document measurement improvements through alternative operationalization approaches.

These metrics should be integrated with your organization's broader fairness assessment framework, providing inputs to subsequent bias identification components focusing on algorithmic design and feedback effects.

4. Case Study: Credit Scoring System

Turing College · 113. Unit 1. Case Study: Credit Scoring System - Scenario Context and Problem Analysis

Scenario Context

A financial services company is developing a machine learning–based credit scoring system to predict default risk for loan applicants. The system will inform lending decisions, interest rates, and credit limits offered to customers. Key stakeholders include the lending institution concerned with risk management, regulators focused on fair lending practices, and diverse applicants seeking equitable access to financial services.

Fairness is particularly critical in this domain due to historical patterns of lending discrimination based on race, gender, and geographic location. Legal frameworks, including the Equal Credit Opportunity Act, specifically prohibit discrimination in lending, making fairness both an ethical and compliance requirement.

Problem Analysis

Applying core concepts from this Unit reveals several potential data biases in the credit scoring scenario:

Historical Bias: The company plans to use its historical lending data for training. Analysis reveals that these data reflect past discriminatory lending practices in which certain neighborhoods (predominantly minority-populated) received fewer loans despite similar creditworthiness to applicants in other areas. This historical pattern created a "financial redlining" effect that would be perpetuated in the new model if not addressed.
Sampling Bias: The historical dataset predominantly contains applicants who received loans, creating selection bias because rejected applicants are not well represented. Further examination shows that the data underrepresent younger applicants, recent immigrants, and individuals from rural areas—groups with less established credit histories but not necessarily higher default risks.
Measurement Bias: The operationalization of "creditworthiness" relies heavily on traditional credit history length and conventional financial products, such as credit cards and mortgages. This measurement approach disadvantages groups that use alternative financial services or have limited credit histories despite responsible financial behavior (e.g., consistently paying rent and utilities on time).
Encoding Bias: Categorical variables—including occupation and education—are encoded using schemes that implicitly rank certain professions and educational paths higher than others in ways that correlate with protected attributes. In addition, zip codes are encoded as categorical variables with unique embeddings, potentially encoding neighborhood demographics into the feature representation.

From an intersectional perspective, the data show particularly sparse representation at the intersection of young age (under 30), female gender, and minority racial status, creating a high risk of poor model performance for these intersectional groups.

Solution Implementation

Turing College · 113 Unit 1. Case Study: Solution Implementation, Outcomes and Lessons

To address these identified data biases, the team implemented a structured approach:

For Historical Bias, they:
Collaborated with domain experts to identify historically discriminatory patterns in lending data;
Augmented their training data with additional sources, including data from community development financial institutions serving underrepresented communities; and
Created synthetic data using fairness-aware generation techniques to fill representational gaps.
For Sampling Bias, they:
Implemented a stratified sampling approach ensuring adequate representation across demographic groups and intersections;
Applied appropriate reweighting techniques to adjust for representation disparities; and
Used reject inference techniques to model outcomes for historically rejected applicants.
For Measurement Bias, they:
Expanded their feature set to include alternative financial data, such as rental and utility payment history;
Validated all features for predictive accuracy across demographic groups, removing features that showed divergent validity; and
Developed composite features that captured financial responsibility through multiple complementary measures.
For Encoding Bias, they:
Redesigned categorical encoding schemes to minimize correlations with protected attributes;
Replaced zip code variables with more generalizable features about community economic indicators; and
Implemented fairness constraints during feature transformation to ensure that encoded representations maintained fairness properties.

Throughout implementation, they maintained explicit focus on intersectional effects, ensuring that their mitigation strategies addressed the specific challenges faced by applicants at the intersection of multiple marginalized identities.

Outcomes and Lessons

The implementation resulted in several measurable improvements:

Demographic representation disparities decreased by 78% across all protected groups.
Statistical disparities in feature distributions between demographic groups were reduced by 64%.
Model performance differences across demographic intersections decreased by 56%, while overall predictive accuracy was maintained.

Key challenges remained, including limited historical data for certain intersectional groups and some tension between regulatory requirements for model explainability and more complex fairness-promoting techniques.

The most generalizable lessons included:

The importance of domain expertise in identifying historical bias patterns specific to financial services.
The effectiveness of combining multiple complementary approaches (data augmentation, reweighting, and measurement expansion) rather than relying on a single intervention.
The critical need for intersectional analysis throughout the process, as aggregate improvements sometimes masked persistent issues for specific intersectional groups.

These insights directly informed the development of the Bias Source Identification Tool, particularly in creating domain-specific evaluation questionnaires and establishing appropriate thresholds for representation requirements across different application contexts.

5. Frequently Asked Questions

FAQ 1: Measuring Representation Without Demographic Data

Q: How can I identify and address sampling and representation biases when my dataset lacks explicit demographic information due to privacy regulations or other constraints?
A: When demographic data are unavailable, you can implement proxy-based analysis, synthetic population comparison, and feature distribution analysis. Use geographically aggregated statistics (e.g., census tract data) as indirect measures, employ privacy-preserving techniques such as federated analysis on protected attributes, and examine distributional differences in supposedly neutral features across subpopulations. Document all assumptions and limitations of these approaches, and, where possible, validate findings through limited demographic audits on smaller, privacy-compliant samples.

FAQ 2: Distinguishing Data Bias From Societal Patterns

Q: When is a statistical disparity in my data a reflection of actual societal patterns versus a problematic bias that requires intervention?
A: This distinction requires both technical analysis and normative judgment. Technically, examine whether disparities persist after controlling for legitimate factors directly related to your prediction target. Analyze whether measurement validity differs across groups, indicating potential bias in how concepts are operationalized. From a normative perspective, assess whether observed patterns reflect historical inequities that your system should avoid perpetuating, even if statistically predictive. The key determination is whether the statistical patterns represent legitimate predictive signals for your specific task or reflect structural disadvantages that, if encoded in your model, would reproduce or amplify societal inequities.

6. Summary and Next Steps

Turing College · 113 Unit 1: Summary and Next Steps

Key Takeaways

Data collection and representation biases form the foundation of fairness issues in AI systems, as biased data inevitably lead to biased models regardless of subsequent interventions. The key concepts from this Unit include:

Historical bias reflects past prejudices and discriminatory practices in the data, creating a foundation upon which subsequent biases build.
Sampling bias occurs when data collection results in unrepresentative datasets that systematically disadvantage certain groups.
Measurement bias emerges from the operationalization of concepts into measurable features in ways that embed unfair assumptions.
Feature representation bias results from encoding and transformation choices that can amplify disparities across groups.

These concepts directly address our guiding questions by explaining how seemingly technical data decisions can embed bias and by providing systematic approaches to identify these issues before model development begins.

Application Guidance

To apply these concepts in your practical work:

Begin any new ML project with a comprehensive data bias audit before model development.
Document data collection processes, sampling procedures, and representation statistics as standard practice.
Validate measurement approaches and feature encodings across demographic groups when demographic data are available.
Implement bias mitigation strategies at the data level first, before attempting algorithmic interventions.

For organizations new to fairness considerations, start by focusing on basic representation analysis and documentation of data collection processes, then progressively incorporate more sophisticated analyses of measurement and encoding biases as capabilities mature.

Looking Ahead

In the next Unit, we will build on this foundation by examining algorithm design and implementation biases—the ways that modeling choices can introduce or amplify unfairness even with perfectly balanced data. You will learn how different learning algorithms, optimization objectives, and hyperparameter choices can create fairness issues, and how to identify these algorithmic bias sources systematically.

The data-level biases we have examined here often interact with algorithmic choices to create complex fairness challenges that neither data interventions nor algorithmic modifications alone can fully address. Understanding both components and their interactions is essential for developing truly effective fairness strategies.

References

Barocas, S., Hardt, M., & Narayanan, A. (2019). Fairness and machine learning: Limitations and opportunities. https://fairmlbook.org

Bolukbasi, T., Chang, K. W., Zou, J. Y., Saligrama, V., & Kalai, A. T. (2016). Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. In Advances in Neural Information Processing Systems (pp. 4349–4357). https://papers.nips.cc/paper/2016/file/a486cd07e4ac3d270571622f4f316ec5-Paper.pdf

Buolamwini, J., & Gebru, T. (2018). Gender shades: Intersectional accuracy disparities in commercial gender classification. In Proceedings of the 1st Conference on Fairness, Accountability, and Transparency (pp. 77–91). https://proceedings.mlr.press/v81/buolamwini18a.html

Larson, J., Mattu, S., Kirchner, L., & Angwin, J. (2016). How we analyzed the COMPAS recidivism algorithm. ProPublica. https://www.propublica.org/article/how-we-analyzed-the-compas-recidivism-algorithm

Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464), 447–453. https://doi.org/10.1126/science.aax2342

Wilson, B., Hoffman, J., & Morgenstern, J. (2019). Predictive inequity in object detection. arXiv preprint arXiv:1902.11097. https://arxiv.org/abs/1902.11097

Unit 2

Unit 2: Algorithm Design and Implementation Biases