Paul Christiano is an American researcher in the field of

artificial intelligence Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...

(AI), with a specific focus on

AI alignment In the field of artificial intelligence (AI), alignment aims to steer AI systems toward a person's or group's intended goals, preferences, or ethical principles. An AI system is considered ''aligned'' if it advances the intended objectives. A '' ...

, which is the subfield of

AI safety AI safety is an interdisciplinary field focused on preventing accidents, misuse, or other harmful consequences arising from artificial intelligence (AI) systems. It encompasses machine ethics and AI alignment, which aim to ensure AI systems are mor ...

research that aims to steer AI systems toward human interests. He serves as the Head of Safety for the U.S. AI Safety Institute inside

NIST The National Institute of Standards and Technology (NIST) is an agency of the United States Department of Commerce whose mission is to promote American innovation and industrial competitiveness. NIST's activities are organized into physical s ...

. He formerly led the language model alignment team at

OpenAI OpenAI, Inc. is an American artificial intelligence (AI) organization founded in December 2015 and headquartered in San Francisco, California. It aims to develop "safe and beneficial" artificial general intelligence (AGI), which it defines ...

and became founder and head of the non-profit Alignment Research Center (ARC), which works on theoretical AI alignment and evaluations of

machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...

models. In 2023, Christiano was named as one of the ''TIME'' 100 Most Influential People in AI (''TIME''100 AI). In September 2023, Christiano was appointed to the UK government's Frontier AI Taskforce advisory board. Before working at the U.S. AI Safety Institute, he was an initial trustee on

Anthropic Anthropic PBC is an American artificial intelligence (AI) startup company founded in 2021. Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI's ChatGPT and Google's Gemini. According to the ...

's Long-Term Benefit Trust.

Education

Christiano attended the

Harker School The Harker School is a private, co-educational school located in San Jose, California. Founded in 1893 as Manzanita Hall, Harker has three campuses: Bucknall, Union, and Saratoga, named after the streets on which they lie. Overview The Buc ...

in San Jose, California. He competed on the U.S. team and won a silver medal at the 49th

International Mathematics Olympiad The International Mathematical Olympiad (IMO) is a mathematical olympiad for pre-university students, and is the oldest of the International Science Olympiads. It is widely regarded as the most prestigious mathematical competition in the world ...

(IMO) in 2008. In 2012, Christiano graduated from the

Massachusetts Institute of Technology The Massachusetts Institute of Technology (MIT) is a Private university, private research university in Cambridge, Massachusetts, United States. Established in 1861, MIT has played a significant role in the development of many areas of moder ...

(MIT) with a degree in mathematics. At MIT, he researched data structures, quantum cryptography, and combinatorial optimization. He then went on to complete a

PhD A Doctor of Philosophy (PhD, DPhil; or ) is a terminal degree that usually denotes the highest level of academic achievement in a given discipline and is awarded following a course of graduate study and original research. The name of the deg ...

at the

University of California, Berkeley The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California), is a Public university, public Land-grant university, land-grant research university in Berkeley, California, United States. Founded in 1868 and named after t ...

. While at Berkeley, Christiano collaborated with researcher Katja Grace on AI Impacts, co-developing a preliminary methodology for comparing supercomputers to brains, using traversed edges per second (TEPS). He also experimented with putting

Carl Shulman The Future of Humanity Institute (FHI) was an interdisciplinary research centre at the University of Oxford investigating big-picture questions about humanity and its prospects. It was founded in 2005 as part of the Faculty of Philosophy and th ...

's donor lottery theory into practice, raising nearly $50,000 in a pool to be donated to a single charity.

Career

At OpenAI, Christiano co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing

reinforcement learning from human feedback In machine learning, reinforcement learning from human feedback (RLHF) is a technique to AI alignment, align an intelligent agent with human preferences. It involves training a reward model to represent preferences, which can then be used to trai ...

(RLHF). He is considered one of the principal architects of RLHF, which in 2017 was "considered a notable step forward in AI safety research", according to ''

The New York Times ''The New York Times'' (''NYT'') is an American daily newspaper based in New York City. ''The New York Times'' covers domestic, national, and international news, and publishes opinion pieces, investigative reports, and reviews. As one of ...

''. Other works such as "AI safety via debate" (2018) focus on the problem of ''scalable oversight'' – supervising AIs in domains where humans would have difficulty judging output quality. Christiano left OpenAI in 2021 to work on more conceptual and theoretical issues in AI alignment and subsequently founded the Alignment Research Center to focus on this area. One subject of study is the problem of ''eliciting latent knowledge'' from advanced machine learning models''.'' ARC also develops techniques to identify and test whether an AI model is potentially dangerous. In April 2023, Christiano told ''

The Economist ''The Economist'' is a British newspaper published weekly in printed magazine format and daily on Electronic publishing, digital platforms. It publishes stories on topics that include economics, business, geopolitics, technology and culture. M ...

'' that ARC was considering developing an industry standard for AI safety. As of April 2024, Christiano was listed as the head of AI safety for the US AI Safety Institute at

. One month earlier in March 2024, staff members and scientists at the institute threatened to resign upon being informed of Christiano's pending appointment to the role, stating that his ties to the

effective altruism Effective altruism (EA) is a 21st-century philosophical and social movement that advocates impartially calculating benefits and prioritizing causes to provide the greatest good. It is motivated by "using evidence and reason to figure out how to b ...

movement may jeopardize the AI Safety Institute's objectivity and integrity.

Views on AI risks

He is known for his views on the potential risks of advanced AI. In 2017, ''Wired'' magazine stated that Christiano and his colleagues at OpenAI weren't worried about the destruction of the human race by "evil robots", explaining that " ey’re more concerned that, as AI progresses beyond human comprehension, the technology’s behavior may diverge from our intended goals." However, in a widely quoted interview with ''

Business Insider ''Business Insider'' (stylized in all caps: BUSINESS INSIDER; known from 2021 to 2023 as INSIDER) is a New York City–based multinational financial and business news website founded in 2007. Since 2015, a majority stake in ''Business Inside ...

'' in 2023, Christiano said that there is a “10–20% chance of AI takeover,

ith The Ith () is a ridge in Germany's Central Uplands which is up to 439 m high. It lies about 40 km southwest of Hanover and, at 22 kilometers, is the longest line of crags in North Germany. Geography Location The Ith is i ...

many rmost humans dead.” He also conjectured a “50/50 chance of doom shortly after you have AI systems that are human level.”

Personal life

Christiano is married to Ajeya Cotra of

Open Philanthropy Open Philanthropy is an American philanthropic advising and funding organization focused on cost-effective, high-impact giving. Its current CEO is Alexander Berger. As of June 2025, Open Philanthropy has directed more than $4 billion in gran ...

References

External links

Personal website

Paul Christiano's writings
on

LessWrong ''LessWrong'' (also written ''Less Wrong'') is a community blog and Internet forum, forum focused on discussion of cognitive biases, philosophy, psychology, economics, rationality, and artificial intelligence, among other topics. It is associa ...

{{DEFAULTSORT:Christiano, Paul American theoretical computer scientists Year of birth missing (living people) Living people Place of birth missing (living people) Nationality missing Massachusetts Institute of Technology alumni University of California, Berkeley alumni OpenAI people Machine learning researchers AI safety scientists