Moonshot AI
   HOME

TheInfoList



OR:

Moonshot AI (Moonshot; ) is an
artificial intelligence Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...
(AI) company based in Beijing, China. As of 2024, it has been dubbed one of China's "AI Tiger" companies by investors with its focus on developing large language models. The company has attracted significant investment and gained attention for its chatbot, ''Kimi'', and its rapid technological advancements.


Background

Moonshot was founded in March 2023 by Yang Zhilin, Zhou Xinyu and Wu Yuxin. It was launched on the 50th anniversary of
Pink Floyd Pink Floyd are an English Rock music, rock band formed in London in 1965. Gaining an early following as one of the first British psychedelic music, psychedelic groups, they were distinguished by their extended compositions, sonic experiments ...
’s
The Dark Side of the Moon ''The Dark Side of the Moon'' is the eighth studio album by the English rock band Pink Floyd, released on 1 March 1973, by Capitol Records in the US and on 16 March 1973, by Harvest Records in the UK. Developed during live performances before ...
which was Yang's favorite album and the inspiration for the company's name. Yang has stated his goal for founding Moonshot AI is to build foundational models to achieve AGI. Yang's three milestones are long context length, multimodal world model, and a scalable general architecture capable of continuous self-improvement without human input. In October 2023, the company released its chatbot, Kimi, which is capable of processing up to 200,000 Chinese characters per conversation. In June 2024, it was reported that Moonshot was planning to enter the US market. An insider revealed Moonshot was developing products for the US market, including an AI role-playing chat application called Ohai as well as a music video generator called Noisee. In response, Moonshot stated it had no plans to develop and release overseas products.


Funding and investments

Moonshot was valued at $300 million when it received its initial funding of $60 million and had 40 employees. In February 2024,
Alibaba Group Alibaba Group Holding Limited, branded as Alibaba (), is a Chinese Multinational corporation, multinational technology company specializing in E-commerce in China, e-commerce, retail, Internet, and technology. Founded on 28 June 1999 in Hangzho ...
led a $1 billion funding round for Moonshot, which gave it a valuation of $2.5 billion. It was reported that Yang and related individuals allegedly cashed out $40 million worth of shares, considered unusually large for a company's first year. In August 2024,
Tencent Tencent Holdings Ltd. ( zh, s=腾讯, p=Téngxùn) is a Chinese Multinational corporation, multinational technology Conglomerate (company), conglomerate and holding company headquartered in Shenzhen. It is one of the highest grossing multimed ...
and Gaorong Capital joined as investors in a $300 million funding round that valued Moonshot at $3.3 billion. While several firms continued to support the company, some investors, including GSR Ventures, reduced their involvement amid concerns related to shareholder disputes and allegations of premature profit-taking. In November 2024, a group of investors filed for arbitration against the company’s co-founder and Chief Technology Officer, alleging that funding rounds were conducted without obtaining required consent from some AI-focused investors.


Products and research


Kimi

In October 2023, Moonshot launched its first AI
chatbot A chatbot (originally chatterbot) is a software application or web interface designed to have textual or spoken conversations. Modern chatbots are typically online and use generative artificial intelligence systems that are capable of main ...
, Kimi which got its moniker from Yang's English name. It had emerged as the closest rival to
Baidu Baidu, Inc. ( ; ) is a Chinese multinational technology company specializing in Internet services and artificial intelligence. It holds a dominant position in China's search engine market (via Baidu Search), and provides a wide variety of o ...
's Ernie Bot. In March 2024, Moonshot claimed Kimi could handle 2 million Chinese characters in a single prompt which was a significant upgrade from the previous version that could only handle 200,000. Due to the increased number of users, on 21 March, Kimi suffered an outage for two days and Moonshot had to issue an apology. On 20 January 2025, Kimi k1.5 was released. Moonshot claimed it matched the performance of
OpenAI o1 OpenAI o1 is a reflective generative pre-trained transformer (GPT). A preview of o1 was released by OpenAI on September 12, 2024. o1 spends time "thinking" before it answers, making it better at complex reasoning tasks, science and programming th ...
in mathematics, coding, and multimodal reasoning capabilities. Kimi has six tiers of plans ranging from 5.2 yuan for four days to 399 yuan for a year of priority use.


Mooncake serving platform

Mooncake is the platform that serves Moonshot’s Kimi chatbot and processes 100 billion tokens daily. Moonshot was awarded the Erik Riedel Best Paper Award at the USENIX FAST conference for the paper detailing the architecture of Mooncake.


Scaling Muon optimizer

In the Moonshot and UCLA joint paper “Muon is Scalable for LLM Training”, the researchers claim to have successfully scaled the Muon optimizer, which was previously known to have strong results in training small language models, to train a 3B/16B-parameter mixture of expert large language model. The researchers indicate that Muon improves computational efficiency by a factor of 2 compared to the standard optimizer, AdamW, in training large models. The researchers have open sourced their Muon optimizer implementation and the pretrained and instruction-tuned checkpoints.


Scaling reinforcement learning with LLMs

In their technical report on the Kimi K1.5 model, Moonshot researchers outline their reinforcement learning methods, which they claim enabled the model to achieve state-of-the-art reasoning capabilities on par with OpenAI’s o1 model. The researchers note that long context scaling and improved policy optimization methods were key, without relying on complex techniques like Monte Carlo tree search, value functions, and process reward models.


See also

*
Baichuan Baichuan AI (Baichuan; ) is an artificial intelligence (AI) company based in Beijing, China. As of 2024, it has been dubbed one of China's "AI Tiger" companies by investors. Background Baichuan was founded in April 2023 by Wang Xiaochuan who ...
*
MiniMax Minimax (sometimes Minmax, MM or saddle point) is a decision rule used in artificial intelligence, decision theory, combinatorial game theory, statistics, and philosophy for ''minimizing'' the possible loss function, loss for a Worst-case scenari ...
*
Zhipu AI Zhipu AI (智谱AI), formally known as Beijing Zhipu Huazhang Technology, is a Chinese technology company specializing in artificial intelligence. As of 2024, it is one of China's "AI Tiger" companies by investors and considered to be the third ...


References


External links

* {{Authority control Chinese companies established in 2023 Artificial intelligence companies