Generative artificial intelligence (generative AI, GenAI, or GAI) is a subset of
artificial intelligence
Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by animals and humans. Example tasks in which this is done include speech r ...
that uses generative models to produce text, images, videos, or other forms of data. These models
learn
Learning is the process of acquiring new understanding, knowledge, behaviors, skills, values, attitudes, and preferences. The ability to learn is possessed by humans, animals, and some machines; there is also evidence for some kind of learn ...
the underlying patterns and structures of their
training data
In machine learning, a common task is the study and construction of algorithms that can learn from and make predictions on data. Such algorithms function by making data-driven predictions or decisions, through building a mathematical model from ...
and use them to produce new data
based on the input, which often comes in the form of natural language
prompts.
Improvements in
transformer
A transformer is a passive component that transfers electrical energy from one electrical circuit to another circuit, or multiple circuits. A varying current in any coil of the transformer produces a varying magnetic flux in the transformer' ...
-based
deep
Deep or The Deep may refer to:
Places United States
* Deep Creek (Appomattox River tributary), Virginia
* Deep Creek (Great Salt Lake), Idaho and Utah
* Deep Creek (Mahantango Creek tributary), Pennsylvania
* Deep Creek (Mojave River tributary), ...
neural networks
A neural network is a network or circuit of biological neurons, or, in a modern sense, an artificial neural network, composed of artificial neurons or nodes. Thus, a neural network is either a biological neural network, made up of biological ...
, particularly
large language models
A large language model (LLM) is a language model consisting of a neural network with many parameters (typically billions of weights or more), trained on large quantities of unlabelled text using self-supervised learning. LLMs emerged around 2018 an ...
(LLMs), enabled an
AI boom
The AI boom, or AI spring, is the ongoing period of rapid progress in the field of artificial intelligence. Prominent examples include protein folding prediction and generative AI, led by laboratories including Google DeepMind and OpenAI.
...
of generative AI systems in the early 2020s. These include
chatbots
A chatbot or chatterbot is a software application used to conduct an on-line chat conversation via text or text-to-speech, in lieu of providing direct contact with a live human agent. Designed to convincingly simulate the way a human would behav ...
such as
ChatGPT
ChatGPT (Generative Pre-trained Transformer) is a chatbot launched by OpenAI in November 2022. It is built on top of OpenAI's GPT-3 family of large language models, and is fine-tuned (an approach to transfer learning) with both supervised and ...
,
Copilot
In aviation, the first officer (FO), also called co-pilot, is the pilot who is second-in-command of the aircraft to the captain, who is the legal commander. In the event of incapacitation of the captain, the first officer will assume command of ...
,
Gemini
Gemini may refer to:
Space
* Gemini (constellation), one of the constellations of the zodiac
** Gemini in Chinese astronomy
* Project Gemini, the second U.S. crewed spaceflight program
* Gemini Observatory, consisting of telescopes in the Northern ...
, and
LLaMA
The llama (; ) (''Lama glama'') is a domesticated South American camelid, widely used as a meat and pack animal by Andean cultures since the Pre-Columbian era.
Llamas are social animals and live with others as a herd. Their wool is so ...
;
text-to-image artificial intelligence image generation systems such as
Stable Diffusion
Stable Diffusion is a deep learning, text-to-image model released in 2022. It is primarily used to generate detailed images conditioned on text descriptions, though it can also be applied to other tasks such as inpainting, outpainting, and genera ...
,
Midjourney, and
DALL-E
DALL-E (stylized as DALL·E) and DALL-E 2 are deep learning models developed by OpenAI to generate digital images from natural language descriptions, called "prompts". DALL-E was revealed by OpenAI in a blog post in January 2021, and uses a ve ...
; and
text-to-video AI generators such as
Sora.
Companies such as
OpenAI
OpenAI is an artificial intelligence (AI) research laboratory consisting of the for-profit corporation OpenAI LP and its parent company, the non-profit OpenAI Inc. The company conducts research in the field of AI with the stated goal of promo ...
,