Margaret Dayhoff
   HOME

TheInfoList



OR:

Margaret Belle (Oakley) Dayhoff (March 11, 1925 – February 5, 1983) was an American
Biophysicist Biophysics is an interdisciplinary science that applies approaches and methods traditionally used in physics to study biological phenomena. Biophysics covers all scales of biological organization, from molecular to organismic and populations ...
and a pioneer in the field of
bioinformatics Bioinformatics () is an interdisciplinary field of science that develops methods and Bioinformatics software, software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, ...
. Dayhoff was a professor at
Georgetown University Medical Center Georgetown University Medical Center is a Washington, D.C.–based biomedical research and educational organization affiliated with Georgetown University that is responsible for over 80% of the university's sponsored research funding and is led ...
and a noted research biochemist at the National Biomedical Research Foundation, where she pioneered the application of mathematics and computational methods to the field of biochemistry. She dedicated her career to applying the evolving computational technologies to support advances in biology and medicine, most notably the creation of protein and nucleic acid databases and tools to interrogate the databases. She originated one of the first substitution matrices,
point accepted mutation A point accepted mutation — also known as a PAM — is the replacement of a single amino acid in the primary structure of a protein with another single amino acid, which is accepted by the processes of natural selection. This definition does no ...
s (''PAM''). The one-letter code used for amino acids was developed by her, reflecting an attempt to reduce the size of the data files used to describe amino acid sequences in an era of punch-card computing. Her PhD degree was from
Columbia University Columbia University in the City of New York, commonly referred to as Columbia University, is a Private university, private Ivy League research university in New York City. Established in 1754 as King's College on the grounds of Trinity Churc ...
in the department of chemistry, where she devised computational methods to calculate molecular resonance energies of several organic compounds. She did postdoctoral studies at the Rockefeller Institute (now
Rockefeller University The Rockefeller University is a Private university, private Medical research, biomedical Research university, research and graduate-only university in New York City, New York. It focuses primarily on the biological and medical sciences and pro ...
) and the
University of Maryland The University of Maryland, College Park (University of Maryland, UMD, or simply Maryland) is a public land-grant research university in College Park, Maryland, United States. Founded in 1856, UMD is the flagship institution of the Univ ...
, and joined the newly established National Biomedical Research Foundation in 1959. She was the first woman to hold office in the
Biophysical Society The Biophysical Society is an international scientific society whose purpose is to lead the development and dissemination of knowledge in biophysics. Founded in 1958, the Society currently consists of over 7,000 members in academia, government, an ...
and the first person to serve as both secretary and eventually president.


Early life

Dayhoff was born an only child in
Philadelphia Philadelphia ( ), colloquially referred to as Philly, is the List of municipalities in Pennsylvania, most populous city in the U.S. state of Pennsylvania and the List of United States cities by population, sixth-most populous city in the Unit ...
, but moved to
New York City New York, often called New York City (NYC), is the most populous city in the United States, located at the southern tip of New York State on one of the world's largest natural harbors. The city comprises five boroughs, each coextensive w ...
when she was ten. Her academic promise was evident from the outset – she was
valedictorian Valedictorian is an academic title for the class rank, highest-performing student of a graduation, graduating class of an academic institution in the United States. The valedictorian is generally determined by an academic institution's grade poin ...
(class of 1942) at Bayside High School,
Bayside, New York Bayside is a neighborhood located in the New York City Borough (New York City), borough of Queens. It is bounded by Whitestone, Queens, Whitestone to the northwest, the Long Island Sound and Little Neck Bay to the northeast, Douglaston, Queens, ...
, and from there received a scholarship to
Washington Square College The New York University College of Arts and Science (CAS) is the primary liberal arts college of New York University (NYU). The school is located near Gould Plaza next to the Courant Institute of Mathematical Sciences and the Stern School of Bus ...
of
New York University New York University (NYU) is a private university, private research university in New York City, New York, United States. Chartered in 1831 by the New York State Legislature, NYU was founded in 1832 by Albert Gallatin as a Nondenominational ...
, graduating magna cum laude in mathematics in 1945 and getting elected to
Phi Beta Kappa The Phi Beta Kappa Society () is the oldest academic honor society in the United States. It was founded in 1776 at the College of William & Mary in Virginia. Phi Beta Kappa aims to promote and advocate excellence in the liberal arts and sciences, ...
.


Research

Dayhoff began a PhD in quantum chemistry under George Kimball in the
Columbia University Columbia University in the City of New York, commonly referred to as Columbia University, is a Private university, private Ivy League research university in New York City. Established in 1754 as King's College on the grounds of Trinity Churc ...
Department of Chemistry. In her graduate thesis, Dayhoff pioneered the use of computer capabilities – i.e. mass-data processing – to theoretical chemistry; specifically, she devised a method of applying punched-card business machines to calculate the resonance energies of several polycyclic organic molecules. Her management of her research data was so impressive that she was awarded a Watson Computing Laboratory Fellowship. As part of this award, she received access to "cutting-edge IBM electronic data processing equipment" at the lab. After completing her PhD, Dayhoff studied
electrochemistry Electrochemistry is the branch of physical chemistry concerned with the relationship between Electric potential, electrical potential difference and identifiable chemical change. These reactions involve Electron, electrons moving via an electronic ...
under Duncan A. MacInnes at the Rockefeller Institute from 1948 to 1951. In 1952, she moved to Maryland with her family and later received research fellowships from the
University of Maryland The University of Maryland, College Park (University of Maryland, UMD, or simply Maryland) is a public land-grant research university in College Park, Maryland, United States. Founded in 1856, UMD is the flagship institution of the Univ ...
(1957–1959), working on a model of
chemical bonding A chemical bond is the association of atoms or ions to form molecules, crystals, and other structures. The bond may result from the electrostatic force between oppositely charged ions as in ionic bonds or through the sharing of electrons as in ...
with Ellis Lippincott. At Maryland, she gained her first exposure to a new high-speed computer, the IBM model 7094. After this ended, she joined the National Biomedical Research Foundation in 1960 as associate director (a position she held for 21 years). At the NBRF, she began to work with Robert Ledley, a dentist who had obtained a degree in physics and become interested in the possibilities of applying computational resources to biomedical problems. He had authored one of the earliest studies of biomedical computation, "Report on the Use of Computer in Biology and Medicine." With their combined expertise, they published a paper in 1962 entitled "COMPROTEIN: A computer program to aid primary protein structure determination" that described a "completed computer program for the
IBM 7090 The IBM 7090 is a second-generation Transistor computer, transistorized version of the earlier IBM 709 vacuum tube mainframe computer that was designed for "large-scale scientific and technological applications". The 7090 is the fourth member o ...
" that aimed to convert peptide digests to protein chain data. They actually began this work in 1958, but were not able to start programming until late 1960. In the early 1960s, Dayhoff also collaborated with Ellis Lippincott and
Carl Sagan Carl Edward Sagan (; ; November 9, 1934December 20, 1996) was an American astronomer, planetary scientist and science communicator. His best known scientific contribution is his research on the possibility of extraterrestrial life, including e ...
to develop thermodynamic models of cosmo-chemical systems, including prebiological planetary atmospheres. She developed a computer program that could calculate equilibrium concentrations of the gases in a planetary atmosphere, enabling the study of the atmospheres of Venus, Jupiter, and Mars, in addition to the present day atmosphere and the primordial terrestrial atmosphere. Using this program, she considered whether the primordial atmosphere had the conditions necessary to generate life. Although she found that numerous small biologically important compounds can appear with no special nonequilibrium mechanism to explain their presence, there were compounds necessary to life that were scarce in the equilibrium model (such as ribose, adenine, and cytosine). Dayhoff also taught physiology and biophysics at
Georgetown University Medical Center Georgetown University Medical Center is a Washington, D.C.–based biomedical research and educational organization affiliated with Georgetown University that is responsible for over 80% of the university's sponsored research funding and is led ...
for 13 years, served as a Fellow of the
American Association for the Advancement of Science The American Association for the Advancement of Science (AAAS) is a United States–based international nonprofit with the stated mission of promoting cooperation among scientists, defending scientific freedom, encouraging scientific responsib ...
and was elected councillor of the International Society for the Study of the Origins of Life in 1980 after 8 years of membership. Dayhoff also served on the editorial boards of three journals: ''
DNA Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
'', ''
Journal of Molecular Evolution The ''Journal of Molecular Evolution'' is a monthly peer-reviewed scientific journal that covers molecular evolution. It is published by Springer Science+Business Media and was established in 1971. The founding editor was Emile Zuckerkandl, who re ...
'' and '' Computers in Biology and Medicine''. In 1966, Dayhoff pioneered the use of computers in comparing protein sequences and reconstructing their evolutionary histories from
sequence alignment In bioinformatics, a sequence alignment is a way of arranging the sequences of DNA, RNA, or protein to identify regions of similarity that may be a consequence of functional, structural biology, structural, or evolutionary relationships between ...
s. To perform this work, she created the single-letter
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
code to minimize the data file size for each sequence. This work, co-authored with Richard Eck, was the first application of computers to infer phylogenies from molecular sequences. It was the first reconstruction of a
phylogeny A phylogenetic tree or phylogeny is a graphical representation which shows the evolutionary history between a set of species or Taxon, taxa during a specific time.Felsenstein J. (2004). ''Inferring Phylogenies'' Sinauer Associates: Sunderland, M ...
(
evolutionary tree A phylogenetic tree or phylogeny is a graphical representation which shows the evolutionary history between a set of species or taxa during a specific time.Felsenstein J. (2004). ''Inferring Phylogenies'' Sinauer Associates: Sunderland, MA. In o ...
) by computers from molecular sequences using a
maximum parsimony In phylogenetics and computational phylogenetics, maximum parsimony is an optimality criterion under which the phylogenetic tree that minimizes the total number of character-state changes (or minimizes the cost of differentially weighted charact ...
method. In later years, she applied these methods to study a number of molecular relationships, such as the catalytic chain and bovine cyclic AMP-dependent protein kinase and the src gene product of Rous avian and Moloney murine sarcoma viruses; antithrombin-III, alpha-antitrypsin, and ovalbumin; epidermal growth factor and the light chain of coagulation factor X; and apolipoproteins A-I, A-II, C-I and C-III. Based on this work, Dayhoff and her coworkers developed a set of substitution matrices called the PAM (Accepted Point Mutation), MDM (Mutation Data Matrix), or Dayhoff Matrix. They are derived from global alignments of closely related protein sequences. The identification number included with the matrix (ex. PAM40, PAM100) refers to the evolutionary distance; greater numbers correspond to greater distances. Matrices using greater evolutionary distances are extrapolated from those used for lesser ones. To produce a Dayhoff matrix, pairs of aligned amino acids in verified alignments are used to build a count matrix, which is then used to estimate at mutation matrix at 1 PAM (considered an evolutionary unit). From this mutation matrix, a Dayhoff scoring matrix may be constructed. Along with a model of indel events, alignments generated by these methods can be used in an iterative process to construct new count matrices until convergence. One of Dayhoff's most important contributions to bioinformatics was her ''Atlas of Protein Sequence and Structure'', a book reporting all known protein sequences (totaling 65) that she published in 1965. This book published a degenerate encoding of amino acids. It was subsequently republished in several editions. This led to the
Protein Information Resource The Protein Information Resource (PIR), located at Georgetown University Medical Center, is an integrated public bioinformatics resource to support genomic and proteomic research, and scientific studies. It contains protein sequences databases H ...
database of protein sequences, the first online database system that could be accessed by telephone line and available for interrogation by remote computers. The book has since been cited nearly 4,500 times. It and the parallel effort by
Walter Goad Walter Goad (1925–2000) was a nuclear physicist at the Los Alamos National Laboratory. During the 1960s, Goad turned his attention from physics to biology and he is best known for his contributions to the founding of GenBank, the most widely u ...
which led to the
GenBank The GenBank sequence database is an open access, annotated collection of all publicly available nucleotide sequences and their protein translations. It is produced and maintained by the National Center for Biotechnology Information (NCBI; a par ...
database of nucleic acid sequences are the twin origins of the modern databases of molecular sequences. The ''Atlas'' was organized by
gene families A gene family is a set of several similar genes, formed by duplication of a single original gene, and generally with similar biochemical functions. One such family are the genes for human hemoglobin subunits; the ten genes are in two clusters on ...
, and she is regarded as a pioneer in their recognition.
Frederick Sanger Frederick Sanger (; 13 August 1918 – 19 November 2013) was a British biochemist who received the Nobel Prize in Chemistry twice. He won the 1958 Chemistry Prize for determining the amino acid sequence of insulin and numerous other prote ...
's determination of the first complete amino acid sequence of a protein (insulin) in 1955, led a number of researchers to sequence various proteins from different species. In the early 1960s, a theory was developed that small differences between homologous protein sequences (sequences with a high likelihood of common ancestry) could indicate the process and rate of evolutionary change on the molecular level. The notion that such molecular analysis could help scientists decode evolutionary patterns in organisms was formalized in the published papers of
Emile Zuckerkandl Émile Zuckerkandl (July 4, 1922 – November 9, 2013) was an Austrian-born French biologist considered one of the founders of the field of molecular evolution. He introduced, with Linus Pauling, the concept of the " molecular clock", which ena ...
and
Linus Pauling Linus Carl Pauling ( ; February 28, 1901August 19, 1994) was an American chemist and peace activist. He published more than 1,200 papers and books, of which about 850 dealt with scientific topics. ''New Scientist'' called him one of the 20 gre ...
in 1962 and 1965.


Table of Dayhoff's encoding of amino acids

: * Dayhoff and Eck originally proposed O for tyrosine. The one letter code was adopted by
IUPAC The International Union of Pure and Applied Chemistry (IUPAC ) is an international federation of National Adhering Organizations working for the advancement of the chemical sciences, especially by developing nomenclature and terminology. It is ...
and remains in general use. Dayhoff's ambiguous one-letter code has been superseded.


Marriage and family

Dayhoff's husband was Edward S. Dayhoff, an experimental physicist who worked with magnetic resonance and with lasers. They had two daughters who are also academics, Ruth and Judith. Judith Dayhoff has a Mathematical Biophysics PhD from
University of Pennsylvania The University of Pennsylvania (Penn or UPenn) is a Private university, private Ivy League research university in Philadelphia, Pennsylvania, United States. One of nine colonial colleges, it was chartered in 1755 through the efforts of f ...
and is the author of ''Neural network architectures: An introduction'' and coauthor of ''Neural Networks and Pattern Recognition''.
Ruth Dayhoff Ruth Dayhoff (born 1952) is an American physician and medical bioinformatician. Early life Dayhoff is the daughter of Margaret Oakley Dayhoff, an early bioinformatician, and Edward S Dayhoff, a distinguished Physicist in the area of Electro-opti ...
graduated summa cum laude in Mathematics from the
University of Maryland The University of Maryland, College Park (University of Maryland, UMD, or simply Maryland) is a public land-grant research university in College Park, Maryland, United States. Founded in 1856, UMD is the flagship institution of the Univ ...
and focused on
Medical Informatics Health informatics combines communications, information technology (IT), and health care to enhance patient care and is at the forefront of the medical technological revolution. It can be viewed as a branch of engineering and applied science. ...
while doing her MD at
Georgetown University School of Medicine Georgetown University School of Medicine, a medical school opened in 1851, is one of Georgetown University's five graduate schools, and is the most applied-to medical school in the nation with a matriculation rate of 1.40%. It is located on Res ...
. During medical school, she co-authored a paper and a chapter in ''The Atlas of Protein Sequence and Structure'' with her mother, describing a new way to measure how closely proteins are related. Her husband Vincent Brannigan is Professor Emeritus of Law and Technology at the University of Maryland School of Engineering. Ruth was a founding Fellow of the
American College of Medical Informatics The American College of Medical Informatics (ACMI) is a college of elected fellows from the United States and abroad who have made significant and sustained contributions to the field of medical informatics. Initially incorporated in 1984, the o ...
. She pioneered the integration of
Medical Imaging Medical imaging is the technique and process of imaging the interior of a body for clinical analysis and medical intervention, as well as visual representation of the function of some organs or tissues (physiology). Medical imaging seeks to revea ...
and invented the Vista Imaging System. She was chosen for the
National Library of Medicine The United States National Library of Medicine (NLM), operated by the United States federal government, is the world's largest medical library. Located in Bethesda, Maryland, the NLM is an institute within the National Institutes of Health. I ...
's project on the 200 women Physicians who "changed the face of medicine." She serves as director of Digital Imaging in Medicine for the
United States Department of Veteran's Affairs The United States Department of Veterans Affairs (VA) is a Cabinet-level executive branch department of the federal government charged with providing lifelong healthcare services to eligible military veterans at the 170 VA medical centers an ...
.


Later life

Dayhoff's ''Atlas'' became a template for many indispensable tools in large portions of DNA or protein-related biomedical research. In spite of this significant contribution, Dayhoff was marginalized by the community of sequencers. The contract to manage
GenBank The GenBank sequence database is an open access, annotated collection of all publicly available nucleotide sequences and their protein translations. It is produced and maintained by the National Center for Biotechnology Information (NCBI; a par ...
(a technology directly related to her research), awarded in 1983 by the NIH, went to
Walter Goad Walter Goad (1925–2000) was a nuclear physicist at the Los Alamos National Laboratory. During the 1960s, Goad turned his attention from physics to biology and he is best known for his contributions to the founding of GenBank, the most widely u ...
at the
Los Alamos National Laboratory Los Alamos National Laboratory (often shortened as Los Alamos and LANL) is one of the sixteen research and development Laboratory, laboratories of the United States Department of Energy National Laboratories, United States Department of Energy ...
. The reason for this attitude was unknown, with theories ranging from sexism to a clash of values with the experimental science community. Despite the success of Dayhoff's ''Atlas'', experimental scientists and researchers considered their sequence information very valuable and were often reluctant to submit it to such a publicly available database. During the last few years of her life, she focused on obtaining stable, adequate, long-term funding to support the maintenance and further development of her
Protein Information Resource The Protein Information Resource (PIR), located at Georgetown University Medical Center, is an integrated public bioinformatics resource to support genomic and proteomic research, and scientific studies. It contains protein sequences databases H ...
. She envisioned an online system of computer programs and databases, accessible by scientists all over the world, for identifying protein from sequence or amino acid composition data, for making predictions based on sequences, and for browsing the known information. Less than a week before she died, she submitted a proposal to the Division of Research Resources at
NIH The National Institutes of Health (NIH) is the primary agency of the United States government responsible for biomedical and public health research. It was founded in 1887 and is part of the United States Department of Health and Human Service ...
for a Protein Identification Resource. After her death, her colleagues worked to make her vision a reality, and the protein database was fully operational by the middle of 1984. Dayhoff died of a heart attack at the age of 57 on February 5, 1983. A fund was established after her death in 1984 to endow the Margaret O. Dayhoff Award, one of the top national honors in biophysics. The award is presented to a woman who "holds very high promise or has achieved prominence while developing the early stages of a career in biophysical research within the purview and interest of the Biophysical Society." It is presented at the annual meeting of the Biophysical Society and includes an honorarium of $2,000. She was survived by her husband, Edward S. Dayhoff of Silver Spring; two daughters, Ruth E. Dayhoff Brannigan of College Park, and Judith E. Dayhoff of Silver Spring, and her father, Kenneth W. Oakley of Silver Spring.


Legacy

David Lipman David J. Lipman is an American biologist who from 1989 to 2017 was the director of the National Center for Biotechnology Information (NCBI) at the National Institutes of Health. NCBI is the home of GenBank, the U.S. node of the INSDC, Internationa ...
, director of the
National Center for Biotechnology Information The National Center for Biotechnology Information (NCBI) is part of the National Library of Medicine (NLM), a branch of the National Institutes of Health (NIH). It is approved and funded by the government of the United States. The NCBI is lo ...
, has called Dayhoff the "mother and father of bioinformatics". Her seminal contributions as the mother of the science of bioinformatics, now routinely used as part of the process for naming bacteria, were acknowledged with a bacterium being named after her in 2020, ''Enemella dayhoffiae''.


References


External links


Picture of Margaret Oakley Dayhoff, c. 1980.
Owned by her daughter Ruth E. Dayhoff, M.D. Made available by the National Library of Medicine.
Profile and photographs of Margaret O. Dayhoff in Grandma got STEM project.
Information submitted to the project by Margaret Dayhoff's son-in-law Vincent, husband of Ruth E. Dayhoff. Also contains biographical information about descendants. *Baby Joseph and Vrundha M. Nair .2012 Woman Innovator in Bioinformatics: Dr. Margaret Oakley Dayhoff. Adv Bio Tech:12 (01) 32–34 {{DEFAULTSORT:Dayhoff, Margaret Oakley 1925 births 1983 deaths Columbia Graduate School of Arts and Sciences alumni American bioinformaticians American women biochemists American physical chemists American women computer scientists American women evolutionary biologists American evolutionary biologists American computational chemists 20th-century American women scientists New York University alumni Georgetown University faculty Presidents of the Biophysical Society 20th-century American chemists