Fairness in

machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...

refers to the various attempts at correcting algorithmic bias in automated decision processes based on machine learning models. Decisions made by computers after a machine-learning process may be considered unfair if they were based on variables considered sensitive. Examples of these kinds of variable include

gender Gender is the range of characteristics pertaining to femininity and masculinity and differentiating between them. Depending on the context, this may include sex-based social structures (i.e. gender roles) and gender identity. Most cultures us ...

ethnicity An ethnic group or an ethnicity is a grouping of people who identify with each other on the basis of shared attributes that distinguish them from other groups. Those attributes can include common sets of traditions, ancestry, language, history, ...

sexual orientation Sexual orientation is an enduring pattern of romantic or sexual attraction (or a combination of these) to persons of the opposite sex or gender, the same sex or gender, or to both sexes or more than one gender. These attractions are generally ...

disability Disability is the experience of any condition that makes it more difficult for a person to do certain activities or have equitable access within a given society. Disabilities may be cognitive, developmental, intellectual, mental, physical, s ...

and more. As it is the case with many ethical concepts, definitions of fairness and bias are always controversial. In general, fairness and bias are considered relevant when the decision process impacts people's lives. In

, the problem of algorithmic bias is well known and well studied. Outcomes may be skewed by a range of factors and thus might be considered unfair with respect to certain groups or individuals. An example would be the way social media sites deliver personalized news to consumers.

Context

Discussion about fairness in machine learning is a relatively recent topic. Since 2016 there has been a sharp increase in research into the topic. This increase could be partly accounted to an influential report by ProPublica that claimed that the

COMPAS Compas, also known as compas direct or compas direk (; Haitian Creole: ''konpa'', ''kompa'' or ''kompa dirèk''), is a modern méringue dance music genre of Haiti. The genre was popularized following the creation of Ensemble Aux Callebasses i ...

software, widely used in US courts to predict

recidivism Recidivism (; from ''recidive'' and ''ism'', from Latin ''recidīvus'' "recurring", from ''re-'' "back" and ''cadō'' "I fall") is the act of a person repeating an undesirable behavior after they have experienced negative consequences of th ...

, was racially biased. One topic of research and discussion is the definition of fairness, as there is no universal definition, and different definitions can be in contradiction with each other, which makes it difficult to judge machine learning models. Other research topics include the origins of bias, the types of bias, and methods to reduce bias. In recent years tech companies have made tools and manuals on how to detect and reduce

bias Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individual, a group ...

in machine learning. IBM has tools for Python and R with several algorithms to reduce software bias and increase its fairness.

Google Google LLC () is an American Multinational corporation, multinational technology company focusing on Search Engine, search engine technology, online advertising, cloud computing, software, computer software, quantum computing, e-commerce, ar ...

has published guidlines and tools to study and combat bias in machine learning.

Facebook Facebook is an online social media and social networking service owned by American company Meta Platforms. Founded in 2004 by Mark Zuckerberg with fellow Harvard College students and roommates Eduardo Saverin, Andrew McCollum, Dustin ...

have reported their use of a tool, Fairness Flow, to detect bias in their AI. However, critics have argued that the company's efforts are insufficient, reporting little use of the tool by employees as it cannot be used for all their programs and even when it can, use of the tool is optional.

Controversies

The use of algorithmic decision making in the legal system has been a notable area of use under scrutiny. In 2014, then U.S. Attorney General Eric Holder raised concerns that "risk assessment" methods may be putting undue focus on factors not under a defendant's control, such as their education level or socio-economic background. The 2016 report by ProPublica on

claimed that black defendants were almost twice as likely to be incorrectly labelled as higher risk than white defendants, while making the opposite mistake with white defendants. The creator of

, Northepointe Inc., disputed the report, claiming their tool is fair and ProPublica made statistical errors, which was subsequently refuted again by ProPublica. Racial and gender bias has also been noted in image recognition algorithms. Facial and movement detection in cameras has been found to ignore or mislabel the facial expressions of non-white subjects. In 2015, the automatic tagging feature in both

Flickr Flickr ( ; ) is an American image hosting and video hosting service, as well as an online community, founded in Canada and headquartered in the United States. It was created by Ludicorp in 2004 and was a popular way for amateur and professiona ...

and Google Photos was found to label black people with tags such as "animal" and "gorilla". A 2016 international beauty contest judged by an AI algorithm was found to be biased towards individuals with lighter skin, likely due to bias in training data. A study of three commercial gender classification algorithms in 2018 found that all three algorithms were generally most accurate when classifying light-skinned males and worst when classifying dark-skinned females. In 2020, an image cropping tool from

Twitter Twitter is an online social media and social networking service owned and operated by American company Twitter, Inc., on which users post and interact with 280-character-long messages known as "tweets". Registered users can post, like, and ...

was shown to prefer lighter skinned faces. DALL-E, a machine learning Text-to-image model released in 2021, has been prone to create racist and sexist images that reinforce societal stereotypes, something that has been admitted by its creators. Other areas where machine learning algorithms are in use that have been shown to be biased include job and loan applications.

Amazon Amazon most often refers to: * Amazons, a tribe of female warriors in Greek mythology * Amazon rainforest, a rainforest covering most of the Amazon basin * Amazon River, in South America * Amazon (company), an American multinational technolog ...

has used software to review job applications that was sexist, for example by penalizing resumes that included the word "women". In 2019,

Apple An apple is an edible fruit produced by an apple tree (''Malus domestica''). Apple trees are cultivated worldwide and are the most widely grown species in the genus '' Malus''. The tree originated in Central Asia, where its wild ances ...

's algorithm to determine credit card limits for their new Apple Card gave significantly higher limits to males than females, even for couples that shared their finances. Mortgage-approval algorithms in use in the U.S. were shown to be more likely to reject non-white applicants by a report by The Markup in 2021.

Group Fairness criteria

classification Classification is a process related to categorization, the process in which ideas and objects are recognized, differentiated and understood. Classification is the grouping of related facts into classes. It may also refer to: Business, organizat ...

problems, an algorithm learns a function to predict a discrete characteristic

Y

, the target variable, from known characteristics

X

. We model

A

as a discrete

random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the p ...

which encodes some characteristics contained or implicitly encoded in

X

that we consider as sensitive characteristics (gender, ethnicity, sexual orientation, etc.). We finally denote by

R

the prediction of the classifier. Now let us define three main criteria to evaluate if a given classifier is fair, that is if its predictions are not influenced by some of these sensitive variables.Solon Barocas; Moritz Hardt; Arvind Narayanan
''Fairness and Machine Learning''
Retrieved 15 December 2019.

Independence

We say the

(R,A)

satisfy independence if the sensitive characteristics

A

are statistically independent of the prediction

R

, and we write

R \bot A.

We can also express this notion with the following formula:

P(R = r\ , \ A = a) = P(R = r\ , \ A = b) \quad \forall r \in R \quad \forall a,b \in A

This means that the classification rate for each target classes is equal for people belonging to different groups with respect to sensitive characteristics

A

. Yet another equivalent expression for independence can be given using the concept of

mutual information In probability theory and information theory, the mutual information (MI) of two random variables is a measure of the mutual dependence between the two variables. More specifically, it quantifies the " amount of information" (in units such as ...

between random variables, defined as

I(X,Y) = H(X) + H(Y) - H(X,Y)

In this formula,

H(X)

is the

entropy Entropy is a scientific concept, as well as a measurable physical property, that is most commonly associated with a state of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodyna ...

of the

X

. Then

(R,A)

satisfy independence if

I(R,A) = 0

. A possible relaxation of the independence definition include introducing a positive

slack Slack may refer to: Places * Slack, West Yorkshire, a village in Calderdale, England * The Slack, a village in County Durham, England * Slack (river), a river in Pas-de-Calais department, France * Slacks Creek, Queensland, a suburb of Logan City, ...

P(R = r\ , \ A = a) \geq P(R = r\ , \ A = b) - \epsilon \quad \forall r \in R \quad \forall a,b \in A

Finally, another possible relaxation is to require

I(R,A) \leq \epsilon

Separation

We say the

(R,A,Y)

satisfy separation if the sensitive characteristics

A

are statistically independent of the prediction

R

given the target value

Y

, and we write

R \bot A\ , \ Y.

We can also express this notion with the following formula:

P(R = r\ , \ Y = q, A = a) = P(R = r\ , \ Y = q, A = b) \quad \forall r \in R \quad q \in Y \quad \forall a,b \in A

This means that all the dependence of the decision

R

on the sensitive attribute

A

must be justified by the actual dependence of the true target variable

Y

. Another equivalent expression, in the case of a binary target rate, is that the true positive rate and the

false positive rate In statistics, when performing multiple comparisons, a false positive ratio (also known as fall-out or false alarm ratio) is the probability of falsely rejecting the null hypothesis for a particular test. The false positive rate is calculated as ...

are equal (and therefore the

false negative rate A false positive is an error in binary classification in which a test result incorrectly indicates the presence of a condition (such as a disease when the disease is not present), while a false negative is the opposite error, where the test result ...

and the true negative rate are equal) for every value of the sensitive characteristics:

P(R = 1\ , \ Y = 1, A = a) = P(R = 1\ , \ Y = 1, A = b) \quad \forall a,b \in A

P(R = 1\ , \ Y = 0, A = a) = P(R = 1\ , \ Y = 0, A = b) \quad \forall a,b \in A

A possible relaxation of the given definitions is to allow the value for the difference between rates to be a positive number lower than a given

confusion matrix

In the field of machine learning and specifically the problem of statistical classification, a confusion matrix, also known as an error matrix, is a specific table layout that allows visualization of the performance of an algorithm, typically a ...

Context

Controversies

Group Fairness criteria

Independence

Separation

Sufficiency

Relationships between definitions

Mathematical formulation of group fairness definitions

Preliminary definitions

Definitions based on predicted outcome

Definitions based on predicted and actual outcomes

Definitions based on predicted probabilities and actual outcome

Social welfare function

Individual Fairness criteria

Causality-based metrics

Bias Mitigation strategies

Preprocessing

Reweighing

Inprocessing

Adversarial debiasing

Postprocessing

Reject Option based Classification

See also

References