Send feedback

Fairness: Types of bias Stay organized with collections Save and categorize content based on your preferences.

outlined_flag
  • Machine learning models can be susceptible to bias due to human involvement in data selection and curation.

  • Understanding common human biases is crucial for mitigating their impact on model predictions.

  • This webpage explores various types of biases, including reporting bias, historical bias, and automation bias, among others, providing definitions and examples for each.

  • Selection bias, group attribution bias, and implicit bias are also discussed, with subtypes and illustrations.

  • While not exhaustive, the presented biases highlight potential areas of concern when developing and evaluating machine learning models.

Machine learning (ML) models are not inherently objective. ML practitioners train models by feeding them a dataset of training examples, and human involvement in the provision and curation of this data can make a model's predictions susceptible to bias.

When building models, it's important to be aware of common human biases that can manifest in your data, so you can take proactive steps to mitigate their effects.

Note: The following inventory of biases provides just a small selection of biases that are often uncovered in machine learning datasets; this list is not intended to be exhaustive. Wikipedia's catalog of cognitive biases enumerates over 100 different types of human bias that can affect our judgment. When auditing your data, beware of any and all potential sources of bias that might skew your model's predictions.

Reporting bias

Historical bias

Automation bias

Selection bias

Selection bias occurs if a dataset's examples are chosen in a way that is not reflective of their real-world distribution. Selection bias can take many different forms, including coverage bias, non-response bias, and sampling bias.

Coverage bias

Non-Response bias

Sampling bias

Group attribution bias

Group attribution bias is a tendency to generalize what is true of individuals to the entire group to which they belong. Group attribution bias often manifests in the two following forms.

In-group bias

Out-group homogeneity bias

Implicit Bias

Confirmation bias

Experimenter's bias

Exercise: Check your understanding

Which of the following types of bias could have contributed to the skewed predictions in the college admissions model described

in the introduction

?

Historical bias

The admissions model was trained on student records from the past 20 years. If minority students were underrepresented in this data, the model could have reproduced the same historical inequities when making predictions on new student data.

In-group bias

The admissions model was trained by current university students, who could have had an unconscious preference for admitting students that came from backgrounds similar to their own, which could have affected how they curated or feature-engineered the data on which the model was trained.

Confirmation bias

The admissions model was trained by current university students, who likely had preexisting beliefs about what types of qualifications correlate with success in the computer science program. They could have inadvertently curated or feature-engineered the data so that the model affirmed these existing beliefs.

Automation bias

Automation bias might explain why the admissions committee chose to use an ML model to make admissions decisions; they might have believed an automated system would produce better results than decisions made by humans. However, automation bias doesn't provide any insight into why the model's predictions ended up being skewed.

Send feedback

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-12-03 UTC.

Need to tell us more? [[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-12-03 UTC."],[],[]]