Eleutherai
   HOME

TheInfoList



OR:

EleutherAI () is a grass-roots non-profit
artificial intelligence Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...
(AI) research group. The group, considered an open-source version of
OpenAI OpenAI, Inc. is an American artificial intelligence (AI) organization founded in December 2015 and headquartered in San Francisco, California. It aims to develop "safe and beneficial" artificial general intelligence (AGI), which it defines ...
, was formed in a
Discord Discord is an instant messaging and Voice over IP, VoIP social platform which allows communication through Voice over IP, voice calls, Videotelephony, video calls, text messaging, and digital media, media. Communication can be private or take ...
server in July 2020 by
Connor Leahy Connor Leahy is a German-American artificial intelligence researcher and entrepreneur known for cofounding EleutherAI and being CEO of AI safety research company Conjecture. He has warned of the existential risk from artificial general intelligenc ...
, Sid Black, and Leo Gao to organize a replication of
GPT-3 Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network, which supersedes recurrence and convolution-based ...
. In early 2023, it formally incorporated as the EleutherAI Institute, a non-profit research institute.


History

EleutherAI began as a
Discord Discord is an instant messaging and Voice over IP, VoIP social platform which allows communication through Voice over IP, voice calls, Videotelephony, video calls, text messaging, and digital media, media. Communication can be private or take ...
server on July 7, 2020, under the tentative name "LibreAI" before rebranding to "EleutherAI" later that month, in reference to
eleutheria The Greek word "ἐλευθερία" (capitalized Ἐλευθερία; Attic Greek pronunciation: ), transliterated as eleutheria, is a Greek term for, and personification of, liberty. Eleutheria personified had a brief career on coins of Alexan ...
, the Greek word for
liberty Liberty is the state of being free within society from oppressive restrictions imposed by authority on one's way of life, behavior, or political views. The concept of liberty can vary depending on perspective and context. In the Constitutional ...
. Its founding members are Connor Leahy, Len Gao, and Sid Black. They co-wrote the code for Eleuther to serve as a collection of open source AI research, creating a machine learning model similar to
GPT-3 Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network, which supersedes recurrence and convolution-based ...
. On December 30, 2020, EleutherAI released The Pile, a curated dataset of diverse text for training
large language model A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially language generation. The largest and most capable LLMs are g ...
s. While the paper referenced the existence of the GPT-Neo models, the models themselves were not released until March 21, 2021. According to a retrospective written several months later, the authors did not anticipate that "people would care so much about our 'small models. On June 9, 2021, EleutherAI followed this up with GPT-J-6B, a six billion parameter language model that was again the largest open-source GPT-3-like model in the world. These language models were released under the Apache 2.0 free software license and are considered to have "fueled an entirely new wave of startups". While EleutherAI initially turned down funding offers, preferring to use Google's TPU Research Cloud Program to source their compute, by early 2021 they had accepted funding from
CoreWeave CoreWeave, Inc. is an American AI cloud-computing startup based in Livingston, New Jersey. It specializes in providing cloud-based graphics processing unit (GPU) infrastructure to artificial intelligence developers and enterprises, and also de ...
(a small cloud computing company) and SpellML (a cloud infrastructure company) in the form of access to powerful GPU clusters that are necessary for large scale machine learning research. On Feb 10, 2022, they released GPT-NeoX-20B, a model similar to their prior work but scaled up thanks to the resources CoreWeave provided. In 2022, many EleutherAI members participated in the BigScience Research Workshop, working on projects including multitask finetuning, training
BLOOM Bloom or blooming may refer to: Science and technology Biology * Bloom, one or more flowers on a flowering plant * Algal bloom, a rapid increase or accumulation in the population of algae in an aquatic system * Jellyfish bloom, a collective n ...
, and designing evaluation libraries. Engineers at EleutherAI,
Stability AI Stability AI Ltd is a UK-based artificial intelligence company, best known for its text-to-image model Stable Diffusion. History and founding Stability AI was founded in 2019 by Emad Mostaque and by Cyrus Hodes. In August 2022 Stability AI r ...
, and
NVIDIA Nvidia Corporation ( ) is an American multinational corporation and technology company headquartered in Santa Clara, California, and incorporated in Delaware. Founded in 1993 by Jensen Huang (president and CEO), Chris Malachowsky, and Curti ...
joined forces with biologists led by
Columbia University Columbia University in the City of New York, commonly referred to as Columbia University, is a Private university, private Ivy League research university in New York City. Established in 1754 as King's College on the grounds of Trinity Churc ...
and
Harvard University Harvard University is a Private university, private Ivy League research university in Cambridge, Massachusetts, United States. Founded in 1636 and named for its first benefactor, the History of the Puritans in North America, Puritan clergyma ...
to train OpenFold, an open-source replication of DeepMind's
AlphaFold2 AlphaFold is an artificial intelligence (AI) program developed by DeepMind, a subsidiary of Alphabet Inc., Alphabet, which performs Protein structure prediction, predictions of protein structure. It is designed using deep learning techniques. Alp ...
. In early 2023, EleutherAI incorporated as a non-profit research institute run by Stella Biderman, Curtis Huebner, and Shivanshu Purohit. This announcement came with the statement that EleutherAI's shift of focus away from training larger language models was part of a deliberate push towards doing work in interpretability, alignment, and scientific research. While EleutherAI is still committed to promoting access to AI technologies, they feel that "there is substantially more interest in training and releasing LLMs than there once was," enabling them to focus on other projects. In July 2024, an investigation by Proof news found that EleutherAI's The Pile dataset includes subtitles from over 170,000
YouTube YouTube is an American social media and online video sharing platform owned by Google. YouTube was founded on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim who were three former employees of PayPal. Headquartered in ...
videos across more than 48,000 channels. The findings drew criticism and accusations of theft from YouTubers and others who had their work published on the platform. In 2025, Stella Biderman served as executive director. Aviya Skowron served as head of policy and ethics. Nora Belrose served as head of interpretability, and Quentin Anthony was head of HPC.


Research

According to their website, EleutherAI is a "decentralized grassroots collective of volunteer researchers, engineers, and developers focused on
AI alignment In the field of artificial intelligence (AI), alignment aims to steer AI systems toward a person's or group's intended goals, preferences, or ethical principles. An AI system is considered ''aligned'' if it advances the intended objectives. A '' ...
, scaling, and
open-source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use and view the source code, design documents, or content of the product. The open source model is a decentrali ...
AI research". While they do not sell any of their technologies as products, they publish the results of their research in academic venues, write blog posts detailing their ideas and methodologies, and provide trained models for anyone to use for free.


The Pile

The Pile is an 886 GB dataset designed for training large language models. It was originally developed to train EleutherAI's GPT-Neo models but has become widely used to train other models, including
Microsoft Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
's Megatron-Turing Natural Language Generation,
Meta AI Meta AI is a research division of Meta (formerly Facebook) that develops artificial intelligence and augmented reality technologies. History The foundation of laboratory was announced in 2013, under the name Facebook Artificial Intelligence ...
's Open Pre-trained Transformers,
LLaMA The llama (; or ) (''Lama glama'') is a domesticated South American camelid, widely used as a List of meat animals, meat and pack animal by Inca empire, Andean cultures since the pre-Columbian era. Llamas are social animals and live with ...
, and Galactica,
Stanford University Leland Stanford Junior University, commonly referred to as Stanford University, is a Private university, private research university in Stanford, California, United States. It was founded in 1885 by railroad magnate Leland Stanford (the eighth ...
's BioMedLM 2.7B, the
Beijing Academy of Artificial Intelligence Beijing Academy of Artificial Intelligence (BAAI) (), also known as Zhiyuan Institute, is a Chinese non-profit artificial intelligence (AI) research laboratory. BAAI conducts AI research and is dedicated to promoting collaboration among academia ...
's Chinese-Transformer-XL, and Yandex's YaLM 100B. Compared to other datasets, the Pile's main distinguishing features are that it is a curated selection of data chosen by researchers at EleutherAI to contain information they thought language models should learn and that it is the only such dataset that is thoroughly documented by the researchers who developed it.


GPT models

EleutherAI's most prominent research relates to its work to train open-source
large language model A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially language generation. The largest and most capable LLMs are g ...
s inspired by OpenAI's
GPT-3 Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network, which supersedes recurrence and convolution-based ...
. EleutherAI's "GPT-Neo" model series has released 125 million, 1.3 billion, 2.7 billion, 6 billion, and 20 billion parameter models. * GPT-Neo (125M, 1.3B, 2.7B): released in March 2021, it was the largest open-source GPT-3-style language model in the world at the time of release. * GPT-J (6B): released in March 2021, it was the largest open-source GPT-3-style language model in the world at the time of release. * GPT-NeoX (20B): released in February 2022, it was the largest open-source language model in the world at the time of release. * Pythia (13B): While prior models focused on scaling larger to close the gap with closed-sourced models like GPT-3, the Pythia model suite goes in another direction. The Pythia suite was designed to facilitate scientific research on the capabilities of and learning processes in large language models. Featuring 154 partially trained model checkpoints, fully public training data, and the ability to reproduce the exact training order, Pythia enables research on verifiable training, social biases, memorization, and more.


VQGAN-CLIP

Following the release of DALL-E by OpenAI in January 2021, EleutherAI started working on Artificial intelligence art, text-to-image synthesis models. When OpenAI did not release DALL-E publicly, EleutherAI's Katherine Crowson and digital artist Ryan Murdock developed a technique for using CLIP (another model developed by OpenAI) to convert regular image generation models into text-to-image synthesis ones. Building on ideas dating back to Google's DeepDream, they found their first major success combining CLIP with another publicly available model called VQGAN and the resulting model is called VQGAN-CLIP. Crowson released the technology by tweeting Project Jupyter, notebooks demonstrating the technique that people could run for free without any special equipment. This work was credited by
Stability AI Stability AI Ltd is a UK-based artificial intelligence company, best known for its text-to-image model Stable Diffusion. History and founding Stability AI was founded in 2019 by Emad Mostaque and by Cyrus Hodes. In August 2022 Stability AI r ...
CEO Emad Mostaque as motivating the founding of Stability AI.


Public reception


Praise

EleutherAI's work to democratize GPT-3 won the UNESCO Netexplo Global Innovation Award in 2021, InfoWorld's Best of Open Source Software Award in 2021 and 2022, was nominated for VentureBeat's AI Innovation Award in 2021. Gary Marcus, a cognitive scientist and noted critic of deep learning companies such as OpenAI and DeepMind, has repeatedly praised EleutherAI's dedication to open-source and transparent research. Maximilian Gahntz, a senior policy researcher at the Mozilla Foundation, applauded EleutherAI's efforts to give more researchers the ability to audit and assess AI technology. "If models are open and if data sets are open, that'll enable much more of the critical research that's pointed out many of the flaws and harms associated with generative AI and that's often far too difficult to conduct."


Criticism

Technology journalist Kyle Wiggers has raised concerns about whether EleutherAI is as independent as it claims, or "whether the involvement of commercially motivated ventures like
Stability AI Stability AI Ltd is a UK-based artificial intelligence company, best known for its text-to-image model Stable Diffusion. History and founding Stability AI was founded in 2019 by Emad Mostaque and by Cyrus Hodes. In August 2022 Stability AI r ...
and Hugging Face—both of which are backed by substantial venture capital—might influence EleutherAI's research."


See also

*List of artificial intelligence companies


References

{{Existential risk from artificial intelligence Language modeling Artificial intelligence laboratories Deep learning Applied machine learning Open-source artificial intelligence