Genetic privacy involves the concept of personal
privacy
Privacy (, ) is the ability of an individual or group to seclude themselves or information about themselves, and thereby express themselves selectively.
The domain of privacy partially overlaps with security, which can include the concepts of a ...
concerning the storing, repurposing, provision to third parties, and displaying of information pertaining to one's
genetic information
A nucleic acid sequence is a succession of Nucleobase, bases within the nucleotides forming alleles within a DNA (using GACT) or RNA (GACU) molecule. This succession is denoted by a series of a set of five different letters that indicate the orde ...
. This concept also encompasses privacy regarding the ability to identify specific individuals by their
genetic sequence
Genetic may refer to:
*Genetics, in biology, the science of genes, heredity, and the variation of organisms
**Genetic, used as an adjective, refers to genes
*** Genetic disorder, any disorder caused by a genetic mutation, whether inherited or de no ...
, and the potential to gain information on specific characteristics about that person via portions of their genetic information, such as their propensity for specific diseases or their immediate or distant ancestry.
With the public release of genome sequence information of participants in large-scale research studies, questions regarding participant privacy have been raised. In some cases, it has been shown that it is possible to identify previously anonymous participants from large-scale genetic studies that released gene sequence information.
Genetic privacy concerns also arise in the context of criminal law because the government can sometimes overcome criminal suspects' genetic privacy interests and obtain their DNA sample. Due to the shared nature of genetic information between family members, this raises privacy concerns of relatives as well.
[Ram, Natalie (July 2018). "Incidental Informants: Police Can Use Genealogy Databases to Help Identify Criminal Relatives--But Should They?". ''Maryland Bar Journal''.]
As concerns and issues of genetic privacy are raised, regulations and policies have been developed in the United States both at a federal and state level.
Significance of genetic information
In the majority of cases, an individual's
genetic sequence
Genetic may refer to:
*Genetics, in biology, the science of genes, heredity, and the variation of organisms
**Genetic, used as an adjective, refers to genes
*** Genetic disorder, any disorder caused by a genetic mutation, whether inherited or de no ...
is considered unique to that individual. One notable exception to this rule in humans is the case of
identical twins
Twins are two offspring produced by the same pregnancy.MedicineNet > Definition of Twin Last Editorial Review: 19 June 2000 Twins can be either ''monozygotic'' ('identical'), meaning that they develop from one zygote, which splits and forms two e ...
, who have nearly identical genome sequences at birth. In the remainder of cases, one's genetic fingerprint is considered specific to a particular person and is regularly used in the identification of individuals in the case of establishing innocence or guilt in legal proceedings via
DNA profiling
DNA profiling (also called DNA fingerprinting and genetic fingerprinting) is the process of determining an individual's deoxyribonucleic acid (DNA) characteristics. DNA analysis intended to identify a species, rather than an individual, is cal ...
. Specific gene variants one's genetic code, known as
allele
An allele is a variant of the sequence of nucleotides at a particular location, or Locus (genetics), locus, on a DNA molecule.
Alleles can differ at a single position through Single-nucleotide polymorphism, single nucleotide polymorphisms (SNP), ...
s, have been shown to have strong predictive effects in the occurrences of diseases, such as the
BRCA1
Breast cancer type 1 susceptibility protein is a protein that in humans is encoded by the ''BRCA1'' () gene. Orthologs are common in other vertebrate species, whereas invertebrate genomes may encode a more distantly related gene. ''BRCA1'' is a ...
and
BRCA2
''BRCA2'' and BRCA2 () are human genes and their protein products, respectively. The official symbol (BRCA2, italic for the gene, nonitalic for the protein) and the official name (originally breast cancer 2; currently BRCA2, DNA repair associate ...
mutant genes in
Breast Cancer
Breast cancer is a cancer that develops from breast tissue. Signs of breast cancer may include a Breast lump, lump in the breast, a change in breast shape, dimpling of the skin, Milk-rejection sign, milk rejection, fluid coming from the nipp ...
and
Ovarian Cancer
Ovarian cancer is a cancerous tumor of an ovary. It may originate from the ovary itself or more commonly from communicating nearby structures such as fallopian tubes or the inner lining of the abdomen. The ovary is made up of three different ...
, or PSEN1, PSEN2, and APP genes in
early-onset Alzheimer's disease
Early-onset Alzheimer's disease (EOAD), also called younger-onset Alzheimer's disease (YOAD), is Alzheimer's disease diagnosed before the age of 65. It is an uncommon form of Alzheimer's, accounting for only 5–10% of all Alzheimer's cases. Ab ...
. Additionally, gene sequences are passed down with a
regular pattern of inheritance between generations, and can therefore reveal one's ancestry via
genealogical DNA test
A genealogical DNA test is a DNA-based Genetic testing, genetic test used in genetic genealogy that looks at specific locations of a person's genome in order to find or verify ancestral genealogical relationships, or (with lower reliability) to ...
ing. Additionally with knowledge of the sequence of one's biological relatives, traits can be compared that allow relationships between individuals, or the lack thereof, to be determined, as is often done in
DNA paternity testing
DNA paternity testing uses DNA profiling, DNA profiles to determine whether an individual is the biology, biological parent of another individual. Paternity testing can be essential when the rights and duties of the father are in issue, and a ch ...
. As such, one's genetic code can be used to infer many characteristics about an individual, including many potentially sensitive subjects such as:
* Parentage / Non-paternity
*
Consanguinity
Consanguinity (from Latin '':wikt: consanguinitas, consanguinitas'' 'blood relationship') is the characteristic of having a kinship with a relative who is descended from a common ancestor.
Many jurisdictions have laws prohibiting people who are ...
* Adoptive Status
* Ancestry
* Propensity for Disease
* Predicted Physical Characteristics
Sources of genetic information

Common specimen types for direct-to-consumer genetic testing are cheek swabs and saliva samples. One of the most popular reasons for at-home genetic testing is to obtain information on an individual's ancestry via genealogical DNA testing and is offered by many companies such as
23andMe
23andMe Holding Co. is an American personal genomics and biotechnology company based in South San Francisco, California. It is best known for providing a direct-to-consumer genetic testing service in which customers provide a saliva testing, sali ...
,
AncestryDNA
Ancestry.com LLC is an American genealogy company based in Lehi, Utah. The largest for-profit genealogy company in the world, it operates a network of genealogical, historical records, and related genetic genealogy websites. It is owned by The ...
,
Family Tree DNA
FamilyTreeDNA is a division of Gene by Gene, a commercial genetic testing company based in Houston, Texas. FamilyTreeDNA offers analysis of autosomal DNA, Y-DNA, and mitochondrial DNA to individuals for genealogical purpose. With a database ...
, or
MyHeritage
MyHeritage is an online genealogy platform with web, mobile, and Genealogy software, software products and services, introduced by the Israeli company MyHeritage in 2003. Users of the platform can obtain their family trees, upload and browse thro ...
. Other tests are also available which provide consumers with information on genes which influence the risk of specific diseases, such as the risk of developing
late-onset Alzheimer's disease or
celiac disease
Coeliac disease (British English) or celiac disease (American English) is a long-term autoimmune disorder, primarily affecting the small intestine. Patients develop intolerance to gluten, which is present in foods such as wheat, rye, spel ...
.
Risks
Privacy Breaches
Studies have shown that genomic data is not immune to adversary attacks.
A study conducted in 2013 revealed vulnerabilities in the security of public databases that contain
genetic data.
As a result, research subjects could sometimes be identified by their DNA alone.
Although reports of premeditated breaches outside of experimental research are disputed, researchers suggest the liability is still important to study.
While accessible genomic data has been pivotal in advancing biomedical research, it also escalates the possibility of exposing sensitive information.
A common practice in genomic medicine to protect patient anonymity involves removing patient identifiers.
However, de-identified data is not subject to the same privileges as the research subjects.
Furthermore, there is an increasing ability to re-identify patients and their genetic relatives from their genetic data.
One study demonstrated re-identification by piecing together genomic data from
short tandem repeats (e.g.
CODIS),
SNPallele frequencies (e.g.
ancestry
An ancestor, also known as a forefather, fore-elder, or a forebear, is a parent or ( recursively) the parent of an antecedent (i.e., a grandparent, great-grandparent, great-great-grandparent and so forth). ''Ancestor'' is "any person from ...
testing), and
whole-genome sequencing
Whole genome sequencing (WGS), also known as full genome sequencing or just genome sequencing, is the process of determining the entirety of the DNA sequence of an organism's genome at a single time. This entails sequencing all of an organism's ...
.
They also hypothesize using a patient's genetic information, ancestry testing, and social media to identify relatives.
Other studies have echoed the risks associated with linking genomic information with public data like social media, including voter registries, web searches, and personal demographics,
or with controlled data, like personal medical records.
There is also controversy regarding the responsibility a
DNA testing company has to ensure that
leaks and breaches do not happen. Determining who legally owns the genomic data, the company or the individual, is of legal concern. There have been published examples of personal genome information being exploited, as well as indirect identification of family members.
Additional privacy concerns, related to, e.g.,
genetic discrimination
Genetic discrimination occurs when people treat others (or are treated) differently because they have or are perceived to have a gene mutation(s) that causes or increases the risk of an inherited disorder. It may also refer to any and all discr ...
, loss of anonymity, and psychological impacts, have been increasingly pointed out by the academic community
as well as government agencies.
Law Enforcement
Additionally, for criminal justice and privacy advocates, the use of genetic information in identifying suspects for criminal investigations proves worrisome under
the United States Fourth Amendment—especially when an indirect genetic link connects an individual to crime scene evidence. Since 2018, law enforcement officials have been harnessing the power of genetic data to revisit
cold case
''Cold Case'' is an American police procedural crime drama television series. It ran on CBS from September 28, 2003, to May 2, 2010. The series revolved around a fictionalized Philadelphia Police Department division that specializes in invest ...
s with DNA evidence.
[Murphy, Heather (2019-04-25). "Sooner or Later Your Cousin's DNA Is Going to Solve a Murder". ''The New York Times''. ]ISSN
An International Standard Serial Number (ISSN) is an eight-digit to uniquely identify a periodical publication (periodical), such as a magazine. The ISSN is especially helpful in distinguishing between serials with the same title. ISSNs a ...
0362-4331. Retrieved 2020-05-19. Suspects discovered through this process are not directly identified by the input of their DNA into established criminal databases, like CODIS. Instead, suspects are identified as the result of familial genetic sleuthing by law enforcement, submitting crime scene DNA evidence to genetic database services that link users whose DNA similarity indicates a family connection.
Officers can then track the newly identified suspect in person, waiting to collect discarded trash that might carry DNA in order to confirm the match.
Despite the privacy concerns of suspects and their relatives, this procedure is likely to survive Fourth Amendment scrutiny.
Much like donors of biological samples in cases of genetic research, criminal suspects do not retain property rights in abandoned waste; they can no longer assert an expectation of privacy in the discarded DNA used to confirm law enforcement suspicions, thereby eliminating their Fourth Amendment protection in that DNA.
Additionally, the genetic privacy of relatives is likely irrelevant under current caselaw since Fourth Amendment protection is “personal” to criminal defendants.
Psychological Impact
In a systematic review of perspectives toward genetic privacy, researchers highlight some of the concerns individuals hold regarding their genetic information, such as the potential dangers and effects on themselves and family members.
Academics note that participating in biomedical research or genetic testing has implications beyond the participant; it can also reveal information about genetic relatives.
The study also found that people expressed concerns as to which body controls their information and if their genetic information could be used against them.
Additionally, the
American Society of Human Genetics
The American Society of Human Genetics (ASHG), founded in 1948, is a professional membership organization for specialists in human genetics. As of 2009, the organization had approximately 8,000 members. The society's members include researchers, ...
has expressed issues about genetic tests in children. They infer that testing could lead to negative consequences for the child. For example, if a child's likelihood for adoption was influenced by genetic testing, the child might suffer from self esteem issues. A child's well-being might also suffer due to paternity testing or custody battles that require this type of information.
Regulations
When the access of genetic information is regulated, it can prevent
insurance companies
Insurance is a means of protection from financial loss in which, in exchange for a fee, a party agrees to compensate another party in the event of a certain loss, damage, or injury. It is a form of risk management, primarily used to protect ...
and employers from reaching such data. This could avoid issues of discrimination, which oftentimes leaves an individual whose information has been breached without a job or without insurance.
In the United States
Federal Regulation
In the United States, biomedical research containing human subjects is governed by a baseline standard of ethics known as
The Common Rule, which aims to protect a subject's privacy by requiring "identifiers" such as name or address to be removed from collected data.
A 2012 report by the
Presidential Commission for the Study of Bioethical Issues
The Presidential Commission for the Study of Bioethical Issues (the Bioethics Commission) was created by on November 24, 2009.s:Executive Order 13521, Executive Order 13521 - ''Establishing the Presidential Commission for the Study of Bioethical ...
stated, however, that "what constitutes 'identifiable' and 'de-identified' data is fluid and that evolving technologies and the increasing accessibility of data could allow de-identified data to become re-identified".
In fact, research has already shown that it is "possible to discover a study participant's identity by cross-referencing research data about him and his DNA sequence …
ith
The Ith () is a ridge in Germany's Central Uplands which is up to 439 m high. It lies about 40 km southwest of Hanover and, at 22 kilometers, is the longest line of crags in North Germany.
Geography
Location
The Ith is i ...
genetic genealogy and public-records databases". This has led to calls for policy-makers to establish consistent guidelines and best practices for the accessibility and usage of individual genomic data collected by researchers.
Privacy protections for genetic research participants were strengthened by provisions of the
21st Century Cures Act
The 21st Century Cures Act is a United States law enacted by the 114th United States Congress in December 2016 and then signed into law on December 13, 2016. It authorized $6.3 billion in funding, mostly for the National Institutes of Health. The ...
(H.R.34) passed on 7 December 2016 for which the American Society of Human Genetics (ASHG) commended Congress,
Senator Warren and
Senator Enzi.
The
Genetic Information Nondiscrimination Act
The Genetic Information Nondiscrimination Act of 2008 (, GINA ), is an Act of Congress in the United States designed to prohibit some types of genetic discrimination. The act bars the use of genetic information in health insurance and employm ...
of 2008 (GINA) protects the genetic privacy of the public, including research participants. The passage of GINA makes it illegal for health insurers or employers to request or require genetic information of an individual or of family members (and further prohibits the discriminatory use of such information).
This protection does not extend to other forms of insurance such as life insurance.
The
Health Insurance Portability and Accountability Act of 1996 (HIPAA) also provides some genetic privacy protections. HIPAA defines health information to include genetic information, which places restrictions on who health providers can share the information with.
State Regulation
Three kinds of laws are frequently associated with genetic privacy: those relating to informed consent and property rights, those preventing insurance discrimination, and those prohibiting employment discrimination.
According to the National Human Genome Research Institute, forty-one states have enacted genetic privacy laws as of January 2020.
However, those privacy laws vary in the scope of protection offered; while some laws "apply broadly to any person" others apply "narrowly to certain entities such as insurers, employers, or researchers."
Arizona, for example, falls in the former category and offers broad protection. Currently, Arizona's genetic privacy statutes focus on the need for informed consent to create, store, or release genetic testing results, but a pending bill would amend the state genetic privacy law framework to grant exclusive property rights in genetic information derived from genetic testing to all persons tested. In expanding privacy rights by including property rights, the bill would grant persons who undergo genetic testing greater control over their genetic information. Arizona also prohibits insurance and employment discrimination on the basis of genetic testing results.
New York State also has strong legislative measures protecting individuals from genetic discrimination. Section 79-I of the New York Civil Rights Law places strict restrictions on the usage of genetic data. The statute also outlines the proper conditions for consenting to genetic data collection or usage.
California similarly offers a broad range of protection for genetic privacy, but it stops short of granting individuals property rights in their genetic information. While currently enacted legislation focuses on prohibiting genetic discrimination in employment and insurance, a piece of pending legislation would extend genetic privacy rights to provide individuals with greater control over genetic information obtained through direct-to-consumer testing services like
23andMe
23andMe Holding Co. is an American personal genomics and biotechnology company based in South San Francisco, California. It is best known for providing a direct-to-consumer genetic testing service in which customers provide a saliva testing, sali ...
.
Florida passed House Bill 1189, a DNA privacy law that prohibits insurers from using genetic data, in July 2020.
On the other hand, Mississippi offers few genetic privacy protections beyond those required by the federal government. In the Mississippi Employment Fairness Act, the legislature recognized the applicability of the
Genetic Information Nondiscrimination Act
The Genetic Information Nondiscrimination Act of 2008 (, GINA ), is an Act of Congress in the United States designed to prohibit some types of genetic discrimination. The act bars the use of genetic information in health insurance and employm ...
, which "prohibit
discrimination on the basis of genetic information with respect to health insurance and employment."
Other
To balance data sharing with the need to protect the privacy of research subjects geneticists are considering to move more data behind controlled-access barriers, authorizing trusted users to access the data from many studies, rather than "having to obtain it piecemeal from different studies".
In October 2005,
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
became the world's first major corporation to establish a genetics privacy policy. Its policy prohibits using employees' genetic information in employment decisions.
Breaching techniques
According to a 2014 study by Yaniv Erlich and
Arvind Narayanan
Arvind Narayanan is a computer scientist and a professor at Princeton University. Narayanan is recognized for his research in the de-anonymization of data. He is currently the director of Princeton University, Princeton's Center for Information ...
, genetic privacy breaching techniques fall into three categories:
Identity Tracing
:Here the aim is to link between an unknown genome and the concealed identity of the data originator by accumulating quasi-identifiers − residual pieces of information that are embedded in the dataset − and to gradually narrow down the possible individuals that match the combination of these quasi-identifiers.
Attribute Disclosure Attacks via DNA (ADAD)
:Here the adversary already has access to the identified
DNA sample of the target and to a database that links DNA-derived data to sensitive attributes without explicit identifiers, for example a public database of the genetic study of drug abuse. The ADAD techniques match the DNA data and associate the identity of the target with the sensitive attribute
Completion Techniques
:Here the adversary also knows the identity of a genomic dataset but has access only to a sanitized version without sensitive
loci. The aim here is to expose the sensitive loci that are not part of the original data.
However, more recent studies have indicated new avenues for breaching genetic privacy:
Phenotype Inferences
:Here, the goal is to use readily available phenotype information about an individual, such as physical features (or some combination thereof), to make genetic inferences. As genetic databases grow at unprecedented rates, providing larger and more comprehensive aggregates, the ability to make inferences with more probabilistic certainty greatly increases. Furthermore, the scope of potential inferences grows with expanding datasets.
Safeguards
According to a 2022 study by Zhiyu Wan et al., safeguards for genetic privacy fall into two categories:
Legal Safeguards
:Legal safeguards include the
Genetic Information Nondiscrimination Act of 2008, the
Health Insurance Portability and Accountability Act of 1996, the
Common Rule
The Common Rule is a 1991 rule of ethics in the United States regarding biomedical and behavioral research involving human subjects. A significant revision became effective July 2018. It governed Institutional Review Boards for oversight of human ...
, the US
National Institutes of Health
The National Institutes of Health (NIH) is the primary agency of the United States government responsible for biomedical and public health research. It was founded in 1887 and is part of the United States Department of Health and Human Service ...
(NIH) data sharing policy,
European Union
The European Union (EU) is a supranational union, supranational political union, political and economic union of Member state of the European Union, member states that are Geography of the European Union, located primarily in Europe. The u ...
’s
General Data Protection Regulation
The General Data Protection Regulation (Regulation (EU) 2016/679), abbreviated GDPR, is a European Union regulation on information privacy in the European Union (EU) and the European Economic Area (EEA). The GDPR is an important component of ...
(GDPR), US state privacy laws (e.g.,
California Consumer Privacy Act
The California Consumer Privacy Act (CCPA) is a state statute intended to enhance privacy rights and consumer protection for residents of the state of California in the United States. The bill was passed by the California State Legislature and si ...
,
California Privacy Rights Act, or
Virginia Consumer Data Protection Act), self-regulations (e.g., data use agreements,
privacy policies, or
terms of service), and
informed consent
Informed consent is an applied ethics principle that a person must have sufficient information and understanding before making decisions about accepting risk. Pertinent information may include risks and benefits of treatments, alternative treatme ...
s.
Technical Safeguards
:Technical safeguards include cryptographic tools, access control, and data perturbation approaches. Specifically, cryptographic approaches include
homomorphic encryption
Homomorphic encryption is a form of encryption that allows computations to be performed on encrypted data without first having to decrypt it. The resulting computations are left in an encrypted form which, when decrypted, result in an output th ...
,
secure multiparty computation
Secure multi-party computation (also known as secure computation, multi-party computation (MPC) or privacy-preserving computation) is a subfield of cryptography with the goal of creating methods for parties to jointly compute a function over their ...
,
trusted execution environment
A trusted execution environment (TEE) is a secure area of a Central processing unit, main processor. It helps the code and data loaded inside it be protected with respect to Information security#Confidentiality, confidentiality and integrity. Data ...
, and
Blockchain
The blockchain is a distributed ledger with growing lists of Record (computer science), records (''blocks'') that are securely linked together via Cryptographic hash function, cryptographic hashes. Each block contains a cryptographic hash of th ...
, whereas data perturbation approaches include
''k''-anonymity,
Beacon services,
differential privacy
Differential privacy (DP) is a mathematically rigorous framework for releasing statistical information about datasets while protecting the privacy of individual data subjects. It enables a data holder to share aggregate patterns of the group while ...
, and
synthetic data generation.
See also
*
*
Genetic discrimination
Genetic discrimination occurs when people treat others (or are treated) differently because they have or are perceived to have a gene mutation(s) that causes or increases the risk of an inherited disorder. It may also refer to any and all discr ...
*
Genetic exceptionalism
*
Gene theft
*
Maryland v. King
*
List of data breaches
This is a list of reports about data breaches, using data compiled from various sources, including press reports, government news releases, and mainstream news articles. The list includes those involving the theft or compromise of 30,000 or more ...
*
Comparative genomics
Comparative genomics is a branch of biological research that examines genome sequences across a spectrum of species, spanning from humans and mice to a diverse array of organisms from bacteria to chimpanzees. This large-scale holistic approach c ...
*
DNA encryption
References
Further reading
* The Right to Know and the Right Not to Know: Genetic Privacy and Responsibility,
External links
Genetic Information Privacy EFF
Medical and Genetic Privacy ACLU
The American Civil Liberties Union (ACLU) is an American nonprofit civil rights organization founded in 1920. ACLU affiliates are active in all 50 states, Washington, D.C., and Puerto Rico. The budget of the ACLU in 2024 was $383 million.
...
Genetic Privacy Laws NCSL
Information privacy
Human genetics
Privacy law
{{Improve categories, date=March 2023