Gemini is a family of
multimodal large language model
A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially language generation.
The largest and most capable LLMs are g ...
s (LLMs) developed by
Google DeepMind
DeepMind Technologies Limited, trading as Google DeepMind or simply DeepMind, is a British–American artificial intelligence research laboratory which serves as a subsidiary of Alphabet Inc. Founded in the UK in 2010, it was acquired by Goo ...
, and the successor to
LaMDA
The London Academy of Music and Dramatic Art (LAMDA) is a drama school located in Hammersmith, London. Founded in 1861, it is the oldest specialist drama school in the British Isles and a founding member of the Federation of Drama Schools. In ...
and
PaLM 2. Comprising Gemini Ultra, Gemini Pro, Gemini Flash, and Gemini Nano, it was announced on December 6, 2023, positioned as a competitor to
OpenAI
OpenAI, Inc. is an American artificial intelligence (AI) organization founded in December 2015 and headquartered in San Francisco, California. It aims to develop "safe and beneficial" artificial general intelligence (AGI), which it defines ...
's
GPT-4
Generative Pre-trained Transformer 4 (GPT-4) is a multimodal large language model trained and created by OpenAI and the fourth in its series of GPT foundation models. It was launched on March 14, 2023, and made publicly available via the p ...
. It powers the
chatbot
A chatbot (originally chatterbot) is a software application or web interface designed to have textual or spoken conversations. Modern chatbots are typically online and use generative artificial intelligence systems that are capable of main ...
of the same name. In March 2025, Gemini 2.5 Pro Experimental was rated as highly competitive.
History
Development
Google
Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
announced Gemini, a
large language model
A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially language generation.
The largest and most capable LLMs are g ...
(LLM) developed by subsidiary
Google DeepMind
DeepMind Technologies Limited, trading as Google DeepMind or simply DeepMind, is a British–American artificial intelligence research laboratory which serves as a subsidiary of Alphabet Inc. Founded in the UK in 2010, it was acquired by Goo ...
, during the
Google I/O
Google I/O, or simply I/O, is an annual developer conference held by Google in Mountain View, California. The name "I/O" is taken from the number googol, with the "I" representing the first digit "1" in a googol and the "O" representing the s ...
keynote on May 10, 2023. It was positioned as a more powerful successor to
PaLM 2, which was also unveiled at the event, with Google CEO
Sundar Pichai
Pichai Sundararajan (born June 10, 1972), better known as Sundar Pichai (pronounced: ), is an American business executive. He is the chief executive officer (CEO) of Alphabet Inc. and its subsidiary Google.
Pichai began his career as a mate ...
stating that Gemini was still in its early developmental stages.
Unlike other LLMs, Gemini was said to be unique in that it was not trained on a
text corpus
In linguistics and natural language processing, a corpus (: corpora) or text corpus is a dataset, consisting of natively digital and older, digitalized, language resources, either annotated or unannotated.
Annotated, they have been used in corp ...
alone and was designed to be
multimodal, meaning it could process multiple types of data simultaneously, including text, images, audio, video, and
computer code Computer code may refer to:
*Source code
*Machine code
*Bytecode
Bytecode (also called portable code or p-code) is a form of instruction set designed for efficient execution by a software interpreter. Unlike human-readable source code, byte ...
.
It had been developed as a collaboration between DeepMind and
Google Brain
Google Brain was a deep learning artificial intelligence research team that served as the sole AI branch of Google before being incorporated under the newer umbrella of Google AI, a research division at Google dedicated to artificial intelligence ...
, two branches of Google that had been merged as Google DeepMind the previous month.
In an interview with ''
Wired
Wired may refer to:
Arts, entertainment, and media Music
* ''Wired'' (Jeff Beck album), 1976
* ''Wired'' (Hugh Cornwell album), 1993
* ''Wired'' (Mallory Knox album), 2017
* "Wired", a song by Prism from their album '' Beat Street''
* "Wired ...
'', DeepMind CEO
Demis Hassabis
Sir Demis Hassabis (born 27 July 1976) is a British artificial intelligence (AI) researcher, and entrepreneur. He is the chief executive officer and co-founder of Google DeepMind, and Isomorphic Labs, and a UK Government AI Adviser. In 2024, Ha ...
touted Gemini's advanced capabilities, which he believed would allow the algorithm to trump
OpenAI
OpenAI, Inc. is an American artificial intelligence (AI) organization founded in December 2015 and headquartered in San Francisco, California. It aims to develop "safe and beneficial" artificial general intelligence (AGI), which it defines ...
's
ChatGPT
ChatGPT is a generative artificial intelligence chatbot developed by OpenAI and released on November 30, 2022. It uses large language models (LLMs) such as GPT-4o as well as other Multimodal learning, multimodal models to create human-like re ...
, which runs on
GPT-4
Generative Pre-trained Transformer 4 (GPT-4) is a multimodal large language model trained and created by OpenAI and the fourth in its series of GPT foundation models. It was launched on March 14, 2023, and made publicly available via the p ...
and whose growing popularity had been aggressively challenged by Google with
LaMDA
The London Academy of Music and Dramatic Art (LAMDA) is a drama school located in Hammersmith, London. Founded in 1861, it is the oldest specialist drama school in the British Isles and a founding member of the Federation of Drama Schools. In ...
and
Bard
In Celtic cultures, a bard is an oral repository and professional story teller, verse-maker, music composer, oral historian and genealogist, employed by a patron (such as a monarch or chieftain) to commemorate one or more of the patron's a ...
. Hassabis highlighted the strengths of DeepMind's
AlphaGo
AlphaGo is a computer program that plays the board game Go. It was developed by the London-based DeepMind Technologies, an acquired subsidiary of Google. Subsequent versions of AlphaGo became increasingly powerful, including a version that c ...
program, which gained worldwide attention in 2016 when it defeated
Go champion
Lee Sedol
Lee Sedol (; born 2 March 1983), or Lee Se-dol, is a South Korean former professional Go player of 9 dan rank. As of February 2016, he ranked second in international titles (18), behind only Lee Chang-ho (21). His nickname is "The Stro ...
, saying that Gemini would combine the power of AlphaGo and other Google–DeepMind LLMs.
In August 2023, ''
The Information'' published a report outlining Google's roadmap for Gemini, revealing that the company was targeting a launch date of late 2023. According to the report, Google hoped to surpass OpenAI and other competitors by combining conversational text capabilities present in most LLMs with
artificial intelligence
Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...
–powered image generation, allowing it to create contextual images and be adapted for a wider range of
use cases
In both software and systems engineering, a use case is a structured description of a system’s behavior as it responds to requests from external actors, aiming to achieve a specific goal. It is used to define and validate functional requireme ...
.
Like Bard,
Google co-founder
Sergey Brin
Sergey Mikhailovich Brin (; born August 21, 1973) is an American computer scientist and businessman who co-founded Google with Larry Page. He was the president of Google's parent company, Alphabet Inc., until stepping down from the role on D ...
was summoned out of retirement to assist in the development of Gemini, along with hundreds of other engineers from Google Brain and DeepMind;
he was later credited as a "core contributor" to Gemini.
Because Gemini was being trained on transcripts of
YouTube
YouTube is an American social media and online video sharing platform owned by Google. YouTube was founded on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim who were three former employees of PayPal. Headquartered in ...
videos, lawyers were brought in to filter out any potentially copyrighted materials.
With news of Gemini's impending launch, OpenAI hastened its work on integrating GPT-4 with multimodal features similar to those of Gemini.
''The Information'' reported in September that several companies had been granted early access to "an early version" of the LLM, which Google intended to make available to clients through
Google Cloud
Google Cloud Platform (GCP) is a suite of cloud computing services offered by Google that provides a series of modular cloud services including computing, data storage, data analytics, and machine learning, alongside a set of management tools ...
's Vertex AI service. The publication also stated that Google was arming Gemini to compete with both GPT-4 and
Microsoft
Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
's
GitHub Copilot
GitHub Copilot is a code completion and automatic programming tool developed by GitHub and OpenAI that assists users of Visual Studio Code, Visual Studio, Neovim, and JetBrains integrated development environments (IDEs) by autocomplete, autocom ...
.
Launch
On December 6, 2023, Pichai and Hassabis announced "Gemini 1.0" at a virtual press conference.
It comprised three models: Gemini Ultra, designed for "highly complex tasks"; Gemini Pro, designed for "a wide range of tasks"; and Gemini Nano, designed for "on-device tasks". At launch, Gemini Pro and Nano were integrated into Bard and the
Pixel 8 Pro smartphone, respectively, while Gemini Ultra was set to power "Bard Advanced" and become available to software developers in early 2024. Other products that Google intended to incorporate Gemini into included
Search
Searching may refer to:
Music
* "Searchin', Searchin", a 1957 song originally performed by The Coasters
* Searching (China Black song), "Searching" (China Black song), a 1991 song by China Black
* Searchin' (CeCe Peniston song), "Searchin" (C ...
,
Ads,
Chrome, Duet AI on
Google Workspace
Google Workspace (formerly G Suite, formerly Google Apps) is a collection of cloud computing, Productivity software, productivity and Collaborative software, collaboration tools, software and products developed and marketed by Google. It con ...
, and
AlphaCode 2.
It was made available only in English.
Touted as Google's "largest and most capable AI model" and designed to emulate human behavior,
the company stated that Gemini would not be made widely available until the following year due to the need for "extensive safety testing".
Gemini was trained on and powered by Google's
Tensor Processing Units (TPUs),
and the name is in reference to the DeepMind–Google Brain merger as well as
NASA
The National Aeronautics and Space Administration (NASA ) is an independent agencies of the United States government, independent agency of the federal government of the United States, US federal government responsible for the United States ...
's
Project Gemini
Project Gemini () was the second United States human spaceflight program to fly. Conducted after the first American crewed space program, Project Mercury, while the Apollo program was still in early development, Gemini was conceived in 1961 and ...
.
Gemini Ultra was said to have outperformed GPT-4,
Anthropic
Anthropic PBC is an American artificial intelligence (AI) startup company founded in 2021. Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI's ChatGPT and Google's Gemini. According to the ...
's
Claude 2,
Inflection AI's Inflection-2,
Meta's
LLaMA 2
Llama (Large Language Model Meta AI, formerly stylized as LLaMA) is a family of large language models (LLMs) released by Meta AI starting in February 2023. The latest version is Llama 4, released in April 2025.
Llama models come in different s ...
, and
xAI's
Grok 1 on a variety of industry benchmarks,
while Gemini Pro was said to have outperformed
GPT-3.5.
Gemini Ultra was also the first language model to outperform human experts on the 57-subject
Massive Multitask Language Understanding
Measuring Massive Multitask Language Understanding (MMLU) is a popular benchmark for evaluating the capabilities of large language models. It inspired several other versions and spin-offs, such as MMLU-Pro, MMMLU and MMLU-Redux.
Overview
MMLU con ...
(MMLU) test, obtaining a score of 90%.
Gemini Pro was made available to Google Cloud customers on AI Studio and Vertex AI on December 13, while Gemini Nano will be made available to
Android developers as well.
Hassabis further revealed that DeepMind was exploring how Gemini could be "combined with robotics to physically interact with the world".
In accordance with
an executive order signed by U.S. President
Joe Biden
Joseph Robinette Biden Jr. (born November 20, 1942) is an American politician who was the 46th president of the United States from 2021 to 2025. A member of the Democratic Party (United States), Democratic Party, he served as the 47th vice p ...
in October, Google stated that it would share testing results of Gemini Ultra with the
federal government of the United States
The Federal Government of the United States of America (U.S. federal government or U.S. government) is the Federation#Federal governments, national government of the United States.
The U.S. federal government is composed of three distinct ...
. Similarly, the company was engaged in discussions with the
government of the United Kingdom
His Majesty's Government, abbreviated to HM Government or otherwise UK Government, is the central government, central executive authority of the United Kingdom of Great Britain and Northern Ireland. to comply with the principles laid out at the
AI Safety Summit at
Bletchley Park
Bletchley Park is an English country house and Bletchley Park estate, estate in Bletchley, Milton Keynes (Buckinghamshire), that became the principal centre of Allies of World War II, Allied World War II cryptography, code-breaking during the S ...
in November.
Updates
Google partnered with
Samsung
Samsung Group (; stylised as SΛMSUNG) is a South Korean Multinational corporation, multinational manufacturing Conglomerate (company), conglomerate headquartered in the Samsung Town office complex in Seoul. The group consists of numerous a ...
to integrate Gemini Nano and Gemini Pro into its
Galaxy S24 smartphone lineup in January 2024.
The following month, Bard and Duet AI were unified under the Gemini brand,
with "Gemini Advanced with Ultra 1.0" debuting via a new "AI Premium" tier of the
Google One
Google One is a subscription service developed by Google that offers expanded cloud storage and is intended for the consumer market. Google One paid plans offer cloud storage starting at 30 gigabytes, up to a maximum of 30 terabytes, an ...
subscription service.
Gemini Pro also received a global launch.
In February, 2024, Google launched Gemini 1.5 in a limited capacity, positioned as a more powerful and capable model than 1.0 Ultra.
This "step change" was achieved through various technical advancements, including a new architecture, a
mixture-of-experts approach, and a larger one-million-token
context window, which equates to roughly an hour of silent video, 11 hours of audio, 30,000 lines of code, or 700,000 words.
The same month, Google debuted Gemma, a family of
free and open-source
Free and open-source software (FOSS) is software available under a Software license, license that grants users the right to use, modify, and distribute the software modified or not to everyone free of charge. FOSS is an inclusive umbrella term ...
LLMs that serve as a lightweight version of Gemini. They come in two sizes, with a neural network with two and seven billion parameters, respectively. Multiple publications viewed this as a response to Meta and others open-sourcing their AI models, and a stark reversal from Google's longstanding practice of keeping its AI proprietary.
Google announced an additional model, Gemini 1.5 Flash, on May 14th at the 2024 I/O keynote.
Gemma 2 was released on June 27, 2024.
Two updated Gemini models, Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002, were released on September 24, 2024.
On December 11, 2024, Google announced Gemini 2.0 Flash Experimental, a significant update to its Gemini AI model. This iteration boasts improved speed and performance over its predecessor, Gemini 1.5 Flash. Key features include a Multimodal Live API for real-time audio and video interactions, enhanced spatial understanding, native image and controllable text-to-speech generation (with watermarking), and integrated tool use, including Google Search. It also introduces improved agentic capabilities, a new Google Gen AI SDK, and "Jules," an experimental AI coding agent for GitHub. Additionally, Google Colab is integrating Gemini 2.0 to generate data science notebooks from natural language. Gemini 2.0 was available through the Gemini chat interface for all users as "Gemini 2.0 Flash experimental".
On January 30, 2025, Google released Gemini 2.0 Flash as the new default model, with Gemini 1.5 Flash still available for usage. This was followed by the release of Gemini 2.0 Pro on February 5, 2025. Additionally, Google released Gemini 2.0 Flash Thinking Experimental, which details the language model's thinking process when responding to prompts.
Gemma 3 was released on March 12, 2025. The next day, Google announced that Gemini in
Android Studio
Android Studio is the official integrated development environment (IDE) for Google's Android operating system, built on JetBrains' IntelliJ IDEA software and designed specifically for Android development. This is available for download on W ...
would be able to understand simple UI
mockups and transform them into working
Jetpack Compose code.
Gemini 2.5 Pro Experimental was released on March 25, 2025, described by Google as its most intelligent AI model yet, featuring enhanced reasoning and coding capabilities,
and a "thinking model" capable of reasoning through steps before responding, using techniques like
chain-of-thought prompting,
whilst maintaining native
multimodality
Multimodality is the application of multiple literacies within one medium. Multiple literacies or "modes" contribute to an audience's understanding of a composition. Everything from the placement of images to the organization of the content to ...
and launching with a 1 million token context window.
At Google I/O 2025, Google announced significant updates to its Gemini core models.
Gemini 2.5 Flash became the default model, delivering faster responses.
Gemini 2.5 Pro was introduced as the most advanced Gemini model, featuring reasoning, coding capabilities, and the new Deep Think mode for complex tasks. Both 2.5 Pro and Flash support native audio output and improved security.
General availability for Gemini 2.5 Pro and Flash is scheduled for June 2025.
Model versions
The following table lists the main model versions of Gemini, describing the significant changes included with each version:
Technical specifications
The first generation of Gemini ("Gemini 1") has three models, with the same architecture. They are decoder-only
transformers
''Transformers'' is a media franchise produced by American toy company Hasbro and Japanese toy company Tomy, Takara Tomy. It primarily follows the heroic Autobots and the villainous Decepticons, two Extraterrestrials in fiction, alien robot fac ...
, with modifications to allow efficient training and inference on TPUs. They have a context length of 32,768 tokens, with
multi-query attention. Two versions of Gemini Nano, Nano-1 (1.8 billion parameters) and Nano-2 (3.25 billion parameters), are
distilled
Distillation, also classical distillation, is the process of separating the component substances of a liquid mixture of two or more chemically discrete substances; the separation process is realized by way of the selective boiling of the mixt ...
from larger Gemini models, designed for use by edge devices such as smartphones. As Gemini is multimodal, each context window can contain multiple forms of input. The different modes can be interleaved and do not have to be presented in a fixed order, allowing for a multimodal conversation. For example, the user might open the conversation with a mix of text, picture, video, and audio, presented in any order, and Gemini might reply with the same free ordering. Input images may be of different
resolutions, while video is inputted as a sequence of images. Audio is sampled at 16
kHz
The hertz (symbol: Hz) is the unit of frequency in the International System of Units (SI), often described as being equivalent to one event (or cycle) per second. The hertz is an SI derived unit whose formal expression in terms of SI base uni ...
and then converted into a sequence of tokens by the Universal Speech Model. Gemini's dataset is multimodal and multilingual, consisting of "web documents, books, and code, and includ
ngimage, audio, and video data".
The second generation of Gemini ("Gemini 1.5") has two models. Gemini 1.5 Pro is a multimodal
sparse mixture-of-experts, with a context length in the millions, while Gemini 1.5 Flash is distilled from Gemini 1.5 Pro, with a context length above 2 million.
Gemma 2 27B is trained on web documents, code, science articles. Gemma 2 9B was distilled from 27B. Gemma 2 2B was distilled from a 7B model that remained unreleased.
, the models released include
* Gemma 1 (2B, 7B)
* CodeGemma (2B and 7B) - Gemma 1 finetuned for code generation.
* Gemma 2 (2B, 9B, 27B) - 27B trained from scratch. 2B and 9B
* Gemma 3 (1B, 4B, 12B, 27B) - Upgrade to Gemma 2, capable of multilinguality (supports 140 languages), longer context length (128k tokens), multimodality, and function calling.
* RecurrentGemma (2B, 9B) - Griffin-based, instead of Transformer-based.
* PaliGemma (3B) - A vision-language model that takes text and image inputs, and outputs text. It is made by connecting a
SigLIP image encoder with a Gemma language model.
* PaliGemma 2 (3B, 10B, 28B) - Upgrade to PaliGemma, capable of more vision-language tasks.
Reception
Gemini's launch was preluded by months of intense speculation and anticipation, which ''
MIT Technology Review
''MIT Technology Review'' is a bimonthly magazine wholly owned by the Massachusetts Institute of Technology. It was founded in 1899 as ''The Technology Review'', and was re-launched without "''The''" in its name on April 23, 1998, under then pu ...
'' described as "peak AI hype".
In August 2023, Dylan Patel and Daniel Nishball of research firm SemiAnalysis penned a
blog post declaring that the release of Gemini would "eat the world" and outclass GPT-4, prompting OpenAI CEO
Sam Altman
Samuel Harris Altman (born April 22, 1985) is an American technology entrepreneur, investor, and the chief executive officer of OpenAI since 2019 (he was Removal of Sam Altman from OpenAI, briefly dismissed and reinstated in November 2023). He ...
to ridicule the duo on
X (formerly Twitter).
Business magnate
Elon Musk
Elon Reeve Musk ( ; born June 28, 1971) is a businessman. He is known for his leadership of Tesla, SpaceX, X (formerly Twitter), and the Department of Government Efficiency (DOGE). Musk has been considered the wealthiest person in th ...
, who co-founded OpenAI, weighed in, asking, "Are the numbers wrong?"
Hugh Langley of ''
Business Insider
''Business Insider'' (stylized in all caps: BUSINESS INSIDER; known from 2021 to 2023 as INSIDER) is a New York City–based multinational financial and business news website founded in 2007. Since 2015, a majority stake in ''Business Inside ...
'' remarked that Gemini would be a make-or-break moment for Google, writing: "If Gemini dazzles, it will help Google change the narrative that it was blindsided by Microsoft and OpenAI. If it disappoints, it will embolden critics who say Google has fallen behind."
Reacting to its unveiling in December 2023,
University of Washington
The University of Washington (UW and informally U-Dub or U Dub) is a public research university in Seattle, Washington, United States. Founded in 1861, the University of Washington is one of the oldest universities on the West Coast of the Uni ...
professor emeritus
Oren Etzioni predicted a "tit-for-tat
arms race
An arms race occurs when two or more groups compete in military superiority. It consists of a competition between two or more State (polity), states to have superior armed forces, concerning production of weapons, the growth of a military, and ...
" between Google and
OpenAI
OpenAI, Inc. is an American artificial intelligence (AI) organization founded in December 2015 and headquartered in San Francisco, California. It aims to develop "safe and beneficial" artificial general intelligence (AGI), which it defines ...
. Professor
Alexei Efros of the
University of California, Berkeley
The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California), is a Public university, public Land-grant university, land-grant research university in Berkeley, California, United States. Founded in 1868 and named after t ...
praised the potential of Gemini's multimodal approach,
while scientist
Melanie Mitchell of the
Santa Fe Institute
The Santa Fe Institute (SFI) is an independent, nonprofit theoretical research institute located in Santa Fe, New Mexico, United States and dedicated to the multidisciplinary study of the fundamental principles of complex adaptive systems, inc ...
called Gemini "very sophisticated". Professor Chirag Shah of the University of Washington was less impressed, likening Gemini's launch to the routineness of
Apple
An apple is a round, edible fruit produced by an apple tree (''Malus'' spp.). Fruit trees of the orchard or domestic apple (''Malus domestica''), the most widely grown in the genus, are agriculture, cultivated worldwide. The tree originated ...
's
annual introduction of a new
iPhone
The iPhone is a line of smartphones developed and marketed by Apple that run iOS, the company's own mobile operating system. The first-generation iPhone was announced by then–Apple CEO and co-founder Steve Jobs on January 9, 2007, at ...
. Similarly,
Stanford University
Leland Stanford Junior University, commonly referred to as Stanford University, is a Private university, private research university in Stanford, California, United States. It was founded in 1885 by railroad magnate Leland Stanford (the eighth ...
's Percy Liang, the University of Washington's
Emily Bender, and the
University of Galway
The University of Galway () is a public research university located in the city of Galway, Ireland.
The university was founded in 1845 as "Queen's College, Galway". It was known as "University College, Galway" (UCG) () from 1908 to 1997 and as ...
's Michael Madden cautioned that it was difficult to interpret benchmark scores without insight into the training data used.
Writing for ''
Fast Company
''Fast Company'' is an American business magazine published monthly in print and online, focusing on technology, business, and design. It releases six print issues annually.
History
''Fast Company'' was founded in November 1995 by Alan Webb ...
'', Mark Sullivan opined that Google had the opportunity to challenge the iPhone's dominant market share, believing that Apple was unlikely to have the capacity to develop functionality similar to Gemini with its
Siri
Siri ( , backronym: Speech Interpretation and Recognition Interface) is a digital assistant purchased, developed, and popularized by Apple Inc., which is included in the iOS, iPadOS, watchOS, macOS, Apple TV, audioOS, and visionOS operating sys ...
virtual assistant
A virtual assistant (VA) is a software agent that can perform a range of tasks or services for a user based on user input such as commands or questions, including verbal ones. Such technologies often incorporate chatbot capabilities to streaml ...
.
Google shares spiked by 5.3 percent the day after Gemini's launch.
Google faced criticism for a demonstrative video of Gemini, which was not conducted in real time.
Gemini 2.5 Pro Experimental debuted at the top position on the LMArena leaderboard, a benchmark measuring human preference, indicating strong performance and output quality.
The model achieved state-of-the-art or highly competitive results across various benchmarks evaluating reasoning, knowledge, science, math, coding, and long-context performance, such as
Humanity's Last Exam
Humanity's Last Exam (HLE) is a language model benchmark consisting of 2,500 questions across a broad range of subjects. It was created jointly by the Center for AI Safety and Scale AI.
Creation
Stanford HAI's AI Index 2025 Annual Report cites ...
, GPQA, AIME 2025, SWE-bench and MRCR.
Initial reviews highlighted its improved reasoning capabilities and performance gains compared to previous versions.
Published benchmarks also showed areas where contemporary models from competitors like
Anthropic
Anthropic PBC is an American artificial intelligence (AI) startup company founded in 2021. Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI's ChatGPT and Google's Gemini. According to the ...
,
xAI, or
OpenAI
OpenAI, Inc. is an American artificial intelligence (AI) organization founded in December 2015 and headquartered in San Francisco, California. It aims to develop "safe and beneficial" artificial general intelligence (AGI), which it defines ...
held advantages.
See also
*
Gato, a multimodal neural network developed by DeepMind
*
Gemini Robotics
References
Further reading
*
*
External links
*
Press releasevia ''
The Keyword''
* White paper fo
1.0an
1.5
{{Artificial intelligence navbox
2023 software
Chatbots
Google DeepMind
Google software
Large language models
Multimodal interaction