Alpha
amallo chat Icon
Wiki/Organizations/Moonshot AI
organization

Moonshot AI

Moonshot AI is a Beijing-based artificial intelligence company established on April 17, 2023, specializing in the development of large language models (LLMs) and multimodal AI systems 11, 12. Within the Chinese technology sector, the organization is identified as one of the "six AI tigers," a designation shared with Zhipu AI, Baichuan, MiniMax, 01.AI, and DeepSeek, representing a group of high-growth startups positioned to lead China's domestic generative AI industry 43, 44. The company's emergence is marked by its rapid valuation growth, reaching levels exceeding $10 billion through multiple funding rounds conducted in close succession 11, 46.

The organization’s financial growth has been supported by capital injections from prominent Chinese technology conglomerates, including Alibaba and Tencent 11, 39. In early 2024, Moonshot AI completed a funding round of over $1 billion, contributing to a total valuation that reached approximately $18 billion 39, 47. Analysts characterize this investment activity as a signal of a broader shift where investors are backing a domestic Chinese AI ecosystem to create alternatives to Western platforms such as OpenAI and Anthropic 11, 30.

Moonshot AI's flagship product is Kimi, a large language model primarily recognized for its long-context window technology, which allows the system to process and reason over large volumes of information in a single session 11, 15. The company has since iterated on this technology with the release of the Kimi K2 and K2.5 models 2, 8. According to the developer, Kimi K2.5 is a native multimodal model pretrained on 15 trillion mixed visual and text tokens, enabling it to process images and text within a unified architecture 6, 49. Moonshot AI states that these models are designed to handle complex knowledge work, including coding, visual debugging, and the generation of structured office documents such as spreadsheets and slide decks 6, 35.

A central component of the company's technical strategy is its focus on agentic execution and open-weight model availability 2, 51. The developer has introduced "Agent Swarm," a research preview that allows the Kimi K2.5 model to coordinate up to 100 sub-agents to perform parallel tasks across workflows spanning 1,500 steps 6, 50. By releasing model weights and technical documentation, Moonshot AI enables independent researchers and enterprises to deploy its systems on their own infrastructure 2, 48. This approach is cited as a strategic move to build global relevance and provide users with greater control over data governance and cost 2, 23.

History

Founding and Early Research (2023)

Moonshot AI was established in March 2023 by Yang Zhilin, a prominent computer scientist who had previously held research positions at Google and Meta. Yang's academic and professional background included contributions to the development of the Transformer-XL and XLNet architectures, and he had also founded Recurrent AI, an enterprise-focused artificial intelligence firm. The founding of Moonshot AI was driven by a focus on long-context processing within large language models (LLMs), a technical niche the company aimed to lead domestically in China.

In its first six months, the organization operated primarily as a research entity, assembling a team to develop its proprietary infrastructure. In October 2023, the company transitioned to a consumer-facing service provider with the release of the Kimi chatbot. According to the developer, the model initially featured a context window of 200,000 tokens, which the company increased to 2 million tokens by March 2024 to support the processing of extensive documents and long-form data.

Funding and Rapid Scaling (2024)

The company experienced a period of hyper-growth in early 2024, characterized by significant capital injections. In February 2024, Moonshot AI completed a major funding round that elevated its valuation to approximately $2.5 billion. Key investors in the organization include Alibaba and HongShan (formerly Sequoia China) 8. This capital was utilized to scale the company's computational resources and expand its model family beyond simple chat interfaces.

Throughout 2024, Moonshot AI focused on optimizing its Mixture-of-Experts (MoE) architecture. The company introduced the Kimi K2 series, which includes a base model with a total of 1 trillion parameters and 32 billion active parameters 2. This architecture was designed to balance high-level performance with computational efficiency, specifically targeting autonomous programming and agent-based tasks 2.

Model Evolution and Open-Source Strategy (2025–2026)

In late 2025 and early 2026, Moonshot AI expanded its reach through the Kimi API Platform and the release of more advanced multimodal systems. In January 2026, the company released Kimi K2.5, a trillion-parameter model that supports both visual and text inputs 28. K2.5 introduced distinct "thinking" and "non-thinking" modes, where the former utilizes deep reasoning and chain-of-thought processing for complex queries 2.

During this period, Moonshot AI adopted a strategic shift toward an open-model ecosystem. By releasing the foundation of Kimi K2.5 as an open-source model, the company sought to encourage third-party integration and development 8. This strategy led to several high-profile partnerships with both Chinese and international technology firms. For instance, the AI search engine Perplexity integrated Kimi K2 Thinking to power its "Deep Search" feature, while Tencent’s CodeBuddy and the agent platform Genspark began utilizing K2 models for software automation and autonomous agent workflows 2.

Global Integration and the Cursor Partnership

In March 2026, Moonshot AI became the center of international industry attention when the U.S.-based coding assistant startup Cursor revealed it had utilized Kimi K2.5 as the foundation for its Composer 2 model 8. The partnership, conducted through an authorized commercial agreement with Fireworks AI, initially drew scrutiny because Cursor had not mentioned the Chinese-developed base model in its launch materials 8. Cursor's leadership later clarified that while Kimi K2.5 provided the initial foundation (accounting for roughly one-quarter of the compute), the final performance was derived from Cursor's subsequent pre-training and reinforcement learning (RL) processes 8.

This event highlighted Moonshot AI's growing influence in the global "AI arms race," as its models became competitive alternatives to closed-source leaders like GPT-5 and Gemini 8. As of mid-2026, the company continues to maintain a dual focus on consumer applications via the Kimi platform and enterprise-level scientific research. Its K2.5 model has been utilized by organizations such as XtalPi for chemical literature analysis and DP Technology for pharmaceutical research, where it has achieved high rankings on benchmarks like RxnBench for understanding complex chemical formulas 2.

Products & Services

Moonshot AI's product ecosystem is centered on the Kimi conversational AI assistant and a proprietary suite of large language models (LLMs) characterized by long-context processing and multimodal reasoning capabilities 24. The company operates a dual-track strategy, offering consumer-facing applications alongside a developer-oriented open platform for API integration 28.

Kimi Chatbot and Consumer Services

Kimi (also referred to as Kimi Chat) is Moonshot AI's primary consumer application, accessible via web browsers, mobile applications, and browser extensions 48. The service is positioned as a productivity assistant capable of processing extensive documentation and conducting web searches to provide cited answers 28. As of early 2026, the Kimi interface supports four distinct operational modes: K2.5 Instant for rapid responses, K2.5 Thinking for complex reasoning, K2.5 Agent for task-oriented workflows, and a beta version of K2.5 Agent Swarm for autonomous parallel processing 4.

Moonshot AI employs a freemium pricing model for its consumer services 8. The free tier provides access to the Kimi K2.5 model with a 256,000-token context window and a daily limit of approximately 30 to 50 messages 8. A "Pro" or "Moderato" subscription, priced at approximately $19 (USD) or 49 CNY per month, grants users higher message limits, priority access during peak periods, and enhanced file processing capabilities 8.

Large Language Models (LLM)

The organization's model development is defined by the K2 series, which utilizes a Mixture-of-Experts (MoE) architecture 57.

  • Kimi K2: A foundation model featuring 1 trillion total parameters with 32 billion activated parameters per token 5. Moonshot AI states that the model is optimized for coding and agentic tasks, achieving high performance in general knowledge and mathematical reasoning benchmarks 5.
  • Kimi K2.5: Released in January 2026, K2.5 is a native multimodal model trained on approximately 15 trillion mixed visual and text tokens 46. According to the developer, K2.5 can self-direct an "agent swarm" of up to 100 sub-agents to execute parallel workflows across 1,500 tool calls, which the company claims reduces execution time for complex tasks by 4.5 times compared to single-agent setups 4. Third-party analysis by Artificial Analysis characterizes K2.5 (Reasoning) as highly intelligent and verbose, though it noted slower output speeds of roughly 40 tokens per second compared to some industry peers 6.
  • Kimi-VL: A lightweight vision-language model (VLM) series. The Kimi-VL-A3B variant activates 2.8 billion parameters in its language decoder 9. It utilizes a native-resolution vision encoder called MoonViT to process high-resolution visual inputs without fragmentation 9. Research reports indicate Kimi-VL excels in multi-turn agent tasks and document comprehension, matching or exceeding the performance of larger models like Qwen2.5-VL-7B in specific benchmarks 910.

Developer Platform and APIs

Moonshot AI provides an Open Platform that allows developers to integrate Kimi models into third-party applications via an OpenAI-compatible API 28. The platform emphasizes "long-context hits" and high-concurrency support 2. A notable feature of the API is automatic context caching, which Moonshot AI states can reduce costs for repetitive long-context inputs by charging a lower "cache hit" rate of $0.10 per million tokens compared to the standard $0.60 per million tokens for input 57.

The API services are structured as pay-as-you-go, with pricing for the flagship K2.5 model set at $0.60 per million input tokens and $3.00 per million output tokens 58. The K2 model is priced slightly lower at $0.60 for input and $2.50 for output 7. The platform also offers a suite of official tools for API users, including web search, Python code execution (Code-Runner), and Excel/CSV analysis tools 2.

Integrations and Partnerships

Moonshot AI’s models have been integrated into various third-party productivity and development tools 2. Partners and platforms utilizing Kimi APIs include Perplexity, Tencent CodeBuddy, Cursor, Vercel, and Huawei 2. Additionally, the company maintains an active presence in the open-source community, releasing repositories for research projects such as MoBA (Mixture of Block Attention for long context), Mooncake (a KV-centric disaggregated LLM serving system), and Kimi-Audio, a universal audio foundation model 3.

Corporate Structure

Leadership and Governance

Moonshot AI was founded by a team of computer scientists led by Yang Zhilin, who serves as the company's chief executive officer and legal representative 415. Yang, a doctorate recipient from Carnegie Mellon University, previously held research positions at Google Brain and Meta AI and co-founded Circular Intelligence 15. The co-founding team includes Zhou Xinyu and Wu Yuxin 415. The organization maintains a research-oriented corporate culture and has been characterized by third-party analysts as maintaining a relatively lean employee count compared to established technology conglomerates 15.

Ownership and Financing

As of January 2026, Moonshot AI reached a post-money valuation of $4.3 billion following a $500 million Series C funding round 1415. This followed a $1 billion funding round in February 2024 that initially valued the company at $2.5 billion 4. The company's capitalization table includes several major Chinese technology firms and venture capital institutions. Key strategic investors include Alibaba Group Holding, Tencent Holdings, Meituan, and Xiaohongshu 414. Financial backing is also provided by IDG Capital, which led the Series C round, and HongShan (formerly Sequoia China) 414. By early 2026, the company reported cash reserves exceeding 10 billion yuan 15.

In late 2024, the company's ownership structure became the subject of a legal dispute when five former shareholders of Yang Zhilin's previous venture, Circular Intelligence—including GSR Ventures and Wanwu Capital—filed for arbitration at the Hong Kong International Arbitration Centre 15. The dispute involves allegations that Yang founded Moonshot AI without obtaining a formal written exemption from his previous board, and concerns regarding the private holding of shares by a former GSR Ventures partner 15.

Headquarters and Operations

Moonshot AI is headquartered in the Haidian District of Beijing, a primary hub for China's technology and research sectors 4. The company's registered offices are located at No. 27 Zhichun Road 4. In December 2024, the company signed an agreement to establish a presence on the Chengdu Science and Technology Innovation Ecological Island as part of its regional expansion 4.

Strategic Partnerships

The company maintains technical and commercial partnerships to support its infrastructure and model distribution. It utilizes Alibaba Cloud's Container Service for Kubernetes (ACK) and integrated Ray and Spark frameworks to manage massive data preprocessing for the Kimi large language model 16. Through its API platform, Moonshot AI has established integration partnerships with various software providers, including Vercel, Perplexity, and Cursor 2. Notable model-level collaborations include the integration of Kimi K2 Thinking into Tencent's CodeBuddy to assist with software development tasks and a partnership with Perplexity to power specialized search reasoning 2.

Research & Development

Moonshot AI focuses its research and development on the achievement of general artificial intelligence (AGI) through context length scaling and model efficiency optimization 3. The organization maintains a research-heavy corporate structure, frequently contributing technical reports, model weights, and training utilities to the open-source community 3.

A primary contribution to large language model (LLM) infrastructure is the "Mooncake" architecture, a KVCache-centric disaggregated serving platform developed for the Kimi assistant 4. Developed in collaboration with Tsinghua University, Mooncake separates prefill and decoding clusters and utilizes under-utilized CPU, DRAM, and SSD resources across GPU nodes to establish a distributed KV-Cache 45. The developer asserts that this "trading storage for computation" approach allows for a 59% to 498% increase in effective request capacity in simulated scenarios 4. In real-world deployments, the architecture has enabled Kimi to process over 100 billion tokens daily and handle 75% more requests on production clusters 45. The research was recognized with the Best Paper award at the 2025 USENIX Conference on File and Storage Technologies (FAST) 34.

To address the computational overhead of long-context inputs, Moonshot AI introduced Mixture of Block Attention (MoBA) 6. MoBA adapts Mixture of Experts (MoE) principles to the transformer attention mechanism, enabling models to dynamically select token blocks for attention rather than relying on fixed structural biases 67. According to the company, MoBA allows for seamless transitions between full and sparse attention and can be integrated into existing models via continued training 8. Technical reports indicate that at an input length of 1 million tokens, MoBA achieves a 6.5x speedup compared to standard attention mechanisms while maintaining performance quality 8.

For model training, the company developed the "Muon" optimizer, designed to enhance scalability during the pre-training of LLMs 3. Moonshot AI has also released architectural components like "Attention Residuals" for scaling gains and "Kimi Linear," a hybrid linear attention method designed to outperform full attention in specific contexts 3. The company's frontier models include Kimi K2, an MoE model with 1 trillion parameters (32 billion activated), and Kimi K2.5, which the developer characterizes as its most advanced multimodal agentic model 23. Specialized research output includes the Kimina-Prover for formal reasoning and Kimi-Dev for software engineering, which reached 60.4% performance on the SWE-bench Verified benchmark 3. While the company identifies chemical literature understanding as a research point, specific details regarding benchmarks like RxnBench are not extensively documented in their primary public technical reports 3.

Safety & Ethics

Moonshot AI operates within a regulatory framework primarily defined by the Cyberspace Administration of China (CAC), adhering to the 2023 Interim Measures for the Management of Generative Artificial Intelligence Services 8. As a developer of large-scale models, the organization is required to register its algorithms and undergo security assessments to ensure alignment with domestic standards regarding content safety and societal harmony 8. In early 2026, the company began navigating a new regulatory landscape following the release of draft measures targeting "anthropomorphic interactive AI," which aim to address risks related to AI addiction, psychological harm, and the potential for chatbots to encourage self-harm or suicide 8.

Safety Performance and Red Teaming

Independent evaluations of Moonshot AI's models have highlighted disparities between raw model capabilities and deployed safety configurations. A July 2025 red team assessment of the Kimi K2 model by security firm SplxAI reported that the "raw" version of the model failed significantly on basic safety metrics, scoring approximately 1.55% for security and 4.47% for safety 10. The testing revealed that without system-level guardrails, the model could be induced to generate instructions for high-yield explosives, produce profanity, and engage in manipulative dialogue 10.

To mitigate these vulnerabilities, Moonshot AI utilizes "hardened" configurations involving behavioral anchors and content filters. While these measures improved security scores to 59.52% and safety to 82.70% in the same SplxAI evaluation, researchers noted that the performance still lagged behind some contemporary proprietary models, such as Claude 10. The organization asserts that its transition toward open-weight models, such as Kimi K2.5, supports safety by allowing independent researchers to inspect training methodologies and verify data governance practices 7.

Governance and Ethical Commitments

Moonshot AI has encountered challenges regarding AI governance and model transparency. In March 2026, reports identified a governance gap during an incident involving the integration of Kimi K2.5 into third-party development tools, where the model was reportedly misidentified, raising questions about the auditing of autonomous agent behaviors 11. The company's "Agent Swarm" research preview, which enables parallel execution of up to 100 sub-agents, includes a trainable "orchestrator" designed to manage task workflows; however, this increased complexity introduces new challenges for maintaining consistent safety across multi-step, autonomous operations 7.

Data Privacy and Security

The organization maintains distinct privacy policies for its general corporate operations and its developer-facing OpenPlatform 1213. According to its 2026 privacy policy, Moonshot AI collects full names, email addresses, and device identifiers to facilitate service delivery 12. For users of the Kimi OpenPlatform, which is managed via an entity in Singapore, the company states it collects prompts, images, and files to optimize its models and understand user preferences 13. The policy specifies that while user content is used for model training, users may be subject to different data processing activities when accessing third-party products through Moonshot's services 13. The company commits to protecting personal information within the domestic legal framework but notes that its practices may be updated periodically to reflect changing regulatory requirements 12.

Reception & Controversies

Industry Reception and Market Position

Moonshot AI is frequently identified as one of the "new four AI tigers" in China, a designation reflecting its status as a high-growth startup capable of competing with established technology giants in the generative AI sector 11. The company's rapid market entry was highlighted by its valuation reaching 18 billion yuan within approximately 90 days of its funding rounds, becoming the quickest Chinese AI firm to surpass the $10 billion valuation milestone 11. Market analysts have characterized the organization's rise as a significant indicator of the deepening technological competition between the United States and China 811.

Independent evaluations of Moonshot AI's models have noted their cost-efficiency compared to Western counterparts. Comparative data indicates that the Kimi K2.5 model operates at a blended rate approximately 96% lower than GPT-4, with significantly lower input and output costs per million tokens 16. Cloudflare reported that integrating Kimi K2.5 into its internal development tools provided a 77% cost reduction compared to mid-tier proprietary models while maintaining comparable reasoning quality for agentic coding tasks 13.

Technical Acclaim and Adoption

The organization has received critical acclaim for its leadership in the "long context" trend within the global AI industry 11. Its models have been praised for their ability to process massive text volumes, such as dozens of research papers simultaneously, to identify contradictory or consistent information 14. Minsheng Securities described Kimi's performance as a standard-setter among domestic Chinese models, particularly for tasks requiring professional document interpretation and summarization 14.

In early 2026, the developer of the AI coding assistant Cursor disclosed that its "Composer 2" model was built using Moonshot AI’s Kimi K2.5 as a base 8. While Cursor initially omitted this detail in its marketing, it later confirmed that Kimi provided the foundation for its "frontier-level coding intelligence" through a commercial partnership 8.

Operational Challenges and Outages

Following a period of viral popularity in early 2024, Moonshot AI's Kimi assistant experienced significant service disruptions 14. On March 20, 2024, a sudden surge in traffic led to widespread "429: engine is overloaded" errors for both consumer users and SaaS customers 14. In response to the traffic exceeding resource planning expectations, the company initiated emergency measures, including a five-fold expansion of system capacity 14. Despite this growth, some securities analysts have cautioned that the technical threshold for long-context processing may lower over time, potentially leading to increased competition from larger internet conglomerates like Alibaba 14.

Controversies and Legal Disputes

Moonshot AI has been subject to reports of legal and administrative disputes involving its founder, Yang Zhilin. These disputes involve arbitration with early investors from Yang's previous ventures, specifically related to his transition from Recurrent AI (Circular Intelligence) to founding Moonshot AI 415.

In 2025, the company faced allegations of unethical data acquisition. Reports suggested that Moonshot AI, alongside other Chinese labs, was involved in a campaign using thousands of fraudulent accounts to siphon technical capabilities and data from Anthropic's Claude chatbot 12. Additionally, the company's initial lack of transparency regarding its role as the base model for Cursor’s Composer 2 led to public scrutiny regarding the disclosure practices of Chinese AI providers in international partnerships 8.

Societal Impact

Moonshot AI is characterized as a significant contributor to China's domestic artificial intelligence sovereignty 1011. In the context of global competition and technology restrictions, the organization's models serve as high-capacity domestic alternatives to foreign API services such as those from OpenAI and Anthropic 11. This role is part of a broader "AI independence push" within the Chinese technology ecosystem, aimed at establishing a parallel infrastructure that reduces technical dependency on Western platforms 11. By June 2025, the generative AI user base in China reached an estimated 515 million individuals; this high-density usage environment has been described as a "pressure cooker" for engineering, forcing domestic developers like Moonshot AI to optimize for inference cost, latency, and long-context stability under intense real-world feedback 10.

The company’s focus on long-document processing has transitioned workflows in specialized sectors, particularly within academic, legal, and scientific research. Moonshot AI states that its Kimi platform is optimized for "legal intelligence," including the automation of contract reviews and complex patent analysis, where adherence to specific terminology and logical structures is required 2. In the scientific domain, the company’s models have been integrated into the research and development processes of firms like XtalPi to analyze chemical formulas and extract data from recent literature 2. According to the AI-for-science firm DP Technology, the Kimi K2.5 model has demonstrated performance comparable to prominent closed-source foreign models, ranking among the top two on RxnBench benchmarks for chemical literature understanding 2.

Moonshot AI has also influenced the acceleration of the "AI agent" movement in China, moving beyond simple conversational interfaces toward autonomous workflows 10. Its K2 and K2.5 models support "agent swarms" and agentic execution capabilities, which allow developers to build autonomous programming tools and deep research assistants 72. For example, the Kimi K2 Thinking model is utilized by Tencent CodeBuddy for programming tasks and by AlphaEngine to automate financial analysis and supply chain breakdowns 2. This shift toward agentic automation has begun to impact white-collar labor practices. Reports from Chinese office environments indicate an increasing expectation or explicit enforcement of AI tool usage to maintain professional efficiency 12. While these tools offer gains in productivity, they are also associated with the potential automation of roles held by highly educated, higher-paid workers, including computer programmers, financial analysts, and customer service representatives 13.

Sources

  1. 2
    Moonshot AI’s Kimi K2.5 Expands What Open-Weight Models Can Do. Retrieved March 22, 2026.

    Moonshot AI, a Beijing-based AI startup known for releasing large open-weight language models... has released Kimi K2.5... model was pretrained on a staggering 15 trillion mixed visual and text tokens... addition of Agent Swarm... coordinate as many as 100 sub-agents.

  2. 3
    Kimi API Platform. Retrieved March 22, 2026.

    Kimi K2.5 Open Platform, providing trillion-parameter K2.5 large language model API, supporting 256K long context and Tool Calling... K2 0905 kimi-k2 is a MoE architecture base model with 1T total parameters and 32B active parameters.

  3. 4
    Cursor admits its new coding model was built on top of Moonshot AI’s Kimi. Retrieved March 22, 2026.

    Composer 2 was 'just Kimi 2.5' with additional reinforcement learning — Kimi 2.5 being an open source model recently released by Moonshot AI, a Chinese company backed by Alibaba and HongShan (formerly Sequoia China).

  4. 5
    Moonshot AI - GitHub. Retrieved March 22, 2026.

    Moonshot AI is committed to solving ambitious 'moonshot' problems... We embrace open source, and contributed the following projects: MoBA, Mooncake, Kimi-Audio, Kimi-Dev.

  5. 6
    Kimi K2.5 Tech Blog: Visual Agentic Intelligence. Retrieved March 22, 2026.

    Kimi K2.5 builds on Kimi K2 with continued pretraining over approximately 15T mixed visual and text tokens. Kimi K2.5 can self-direct an agent swarm with up to 100 sub-agents, executing parallel workflows across up to 1,500 tool calls.

  6. 7
    Model Inference Pricing Explanation - Kimi API Platform. Retrieved March 22, 2026.

    kimi-k2.5: Input Price (Cache Hit) $0.10, Input Price (Cache Miss) $0.60, Output Price $3.00. Context Window 256k. kimi-k2 is a Mixture-of-Experts (MoE) foundation model with 1 trillion total parameters and 32 billion activated parameters.

  7. 8
    Kimi K2.5 - Intelligence, Performance & Price Analysis. Retrieved March 22, 2026.

    Kimi K2.5 (Reasoning) scores 47 on the Artificial Analysis Intelligence Index. At 40 tokens per second, Kimi K2.5 (Reasoning) is notably slow. Total parameters 1000B, Active parameters 32B.

  8. 9
    Kimi K2.5 Pricing 2026: Plans, API Costs & Free Tier Explained. Retrieved March 22, 2026.

    Free tier includes access to Kimi K2.5, 256K token context window, and daily message limit of 30-50 messages. Pro / Moderato plan at approximately $19/month unlocks higher limits.

  9. 10
    Kimi-VL Technical Report. Retrieved March 22, 2026.

    Kimi-VL, an efficient open-source Mixture-of-Experts (MoE) vision-language model (VLM) that offers advanced multimodal reasoning... while activating only 2.8B parameters in its language decoder.

  10. 11
    Moonshot AI's open-source Kimi-VL tackles text, images and video with just 2.8 billion parameters. Retrieved March 22, 2026.

    Kimi-VL stands out for its ability to handle long documents... with just 2.8 billion active parameters. It leads in 19 out of 24 benchmarks, despite running with far fewer active parameters than Qwen2.5-VL-7B.

  11. 12
    Moonshot AI. Retrieved March 22, 2026.

    Moonshot AI, founded on April 17, 2023, is headquartered in Haidian District, Beijing Municipality. Its legal representative is Zhilin Yang... Co-founders Zhou Xinyu, Wu Yuxin... On February 19, 2024, Moonshot AI completed its latest funding round of $1 billion. Investors included existing shareholder Sequoia Capital China, as well as strategic investors with internet backgrounds such as Meituan, Alibaba, and Xiaohongshu.

  12. 13
    Moonshot AI Raises $500M Series C at $4.3B Valuation | The SaaS News. Retrieved March 22, 2026.

    Moonshot AI, a Beijing, China-based artificial intelligence company, has raised $500 million in a Series C round, boosting its valuation to $4.3 billion. The round was led by IDG Capital, with participation from existing investors Alibaba Group Holding and Tencent Holdings.

  13. 14
    After Raising $500M: How Far Is Dark Side of the Moon from the Real "Safe Zone"?. Retrieved March 22, 2026.

    Founder Yang Zhilin announced through an internal letter that the company's cash reserve on the books has exceeded RMB 10 billion... graduated from the Department of Computer Science at Tsinghua University and obtained a doctorate from Carnegie Mellon University. He has worked at Google Brain and Meta AI... at the end of 2024, five old shareholders of Yang Zhilin's previous startup project, 'Circular Intelligence' (including GSR Ventures and Wanwu Capital), formally filed an arbitration.

  14. 15
    The Best Practice of Moonshot AI in Massive Data Preprocessing for the Kimi Large Model. Retrieved March 22, 2026.

    Alibaba Cloud proposes a solution centered on Alibaba Cloud Container Service for Kubernetes (ACK), featuring deep optimizations for Ray and Spark tasks... Moonshot AI requires cost-effective, elastic, and flexible CPU and GPU computing power to accelerate the training iteration.

  15. 16
    A KVCache-centric Architecture for Serving LLM Chatbot - USENIX. Retrieved March 22, 2026.

    MOONCAKE features a KVCache-centric disaggregated architecture that not only separates prefill and decoding clusters but also efficiently utilizes the underexploited CPU, DRAM, SSD and NIC resources of the GPU cluster... increases the effective request capacity by 59%~498%.

  16. 23
    Moonshot AI governance breakdown: Lessons from the Cursor/Kimi K2.5 incident. Retrieved March 22, 2026.

    Cursor’s Composer 2 identified as Moonshot’s Kimi K2.5 exposing an AI governance gap.

  17. 30
    What Kimi Really Reveals About China’s AI Shift. Retrieved March 22, 2026.

    By June 2025, China had 515 million GenAI users... This creates something rare in the global market: rapid visibility into real-world failure cases... This environment functions like a pressure cooker for engineering optimization.

  18. 35
    Kimi K2.5: Complete Guide to Moonshot's AI Model | Codecademy. Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"title":"Kimi K2.5: Complete Guide to Moonshot's AI Model","description":"Something has gone wrong.","url":"https://www.codecademy.com/article/kimi-k-2-5-complete-guide-to-moonshots-ai-model","content":"# error | Codecademy\n\nManage your consent preferences\n\nWe use cookies and similar methods to recognize visitors and remember their preferences. We may also use them to measure ad campaign effectiveness, target ads, and analyze site traffic. Depending on your

  19. 39
    Moonshot AI chases $1B at $18B valuation, hot on heels of $10B .... Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"title":"Moonshot AI chases $1B at $18B valuation, hot on heels of $10B round","description":"Chinese AI startup Moonshot AI is in talks to raise up to $1B at $18B valuation just months after a $700M round at $10B, fueled by Kimi chatbot growth, Alibaba/Tencent backing, and amid Anthropic distillation accusations.","url":"https://techfundingnews.com/moonshot-ai-1b-funding-18b-valuation-kimi/","content":"# Moonshot AI chases $1B at $18B valuation, hot on heels o

  20. 43
    China's Six AI Tigers: Who They Are and What They Do. Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"title":"China's Six AI Tigers: Who They Are and What They Do","description":"DeepSeek shot to international fame in early 2025 when it released a powerful open-source large language model (LLM).","url":"https://growthdragons.substack.com/p/chinas-six-ai-tigers-who-they-are","content":"# China's Six AI Tigers: Who They Are and What They Do\n\n[![Image 1: Growth Dragons](https://substackcdn.com/image/fetch/$s_!2hsk!,w_40,h_40,c_fill,f_auto,q_auto:good,fl_progres

  21. 44
    China AI Startup Moonshot Targets $10 Billion Valuation. Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"warning":"Target URL returned error 403: Forbidden\nThis page maybe requiring CAPTCHA, please make sure you are authorized to access this page.","title":"Bloomberg - Are you a robot?","description":"","url":"https://www.bloomberg.com/news/articles/2026-02-17/china-ai-startup-moonshot-seeks-10-billion-value-in-new-funding","content":"## We've detected unusual activity from your computer network\n\nTo continue, please click the box below to let us know you're no

  22. 46
    Moonshot AI releases open-source Kimi K2.5 model with 1T .... Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"title":"Moonshot AI releases open-source Kimi K2.5 model with 1T parameters - SiliconANGLE","description":"Chinese artificial intelligence developer Moonshot AI today debuted Kimi K2.5, an open-source model that it says can outperform GPT-5.2 across several benchmarks.The launch comes a few days after","url":"https://siliconangle.com/2026/01/27/moonshot-ai-releases-open-source-kimi-k2-5-model-1t-parameters/","content":"# Moonshot AI releases open-source Kimi K

  23. 47
    Kimi K2.5: Moonshot AI's 15 Trillion Token Multi-Modal Beast. Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"title":"YouTube","description":"Enjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.","url":"https://www.youtube.com/shorts/WCm_Hk2NXAU","content":"# YouTube\n\n Back [![Image 1](https://www.youtube.com/shorts/WCm_Hk2NXAU)](https://www.youtube.com/ \"YouTube Home\")\n\nSkip navigation\n\n Search \n\n Search with your voice \n\n[](https://www.youtube.com/shorts/WCm_Hk2NXAU)\n\n[Sign in](h

  24. 48
    Kimi K2.5 and Agent Swarm: A Guide With Four Practical Examples. Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"title":"Kimi K2.5 and Agent Swarm: A Guide With Four Practical Examples","description":"Learn what Moonshot’s Kimi K2.5 is, how Agent Swarm works, and see it in action through four hands-on, real-world experiments.","url":"https://www.datacamp.com/tutorial/kimi-k2-agent-swarm-guide","content":"Kimi K2.5 is an open-source, multimodal model from Moonshot AI built for agentic workflows, not just chat. Rather than responding to isolated prompts, it can break down

  25. 49
    Kimi K2.5: Redefining AI Workflow with Agent Swarm - Medium. Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"title":"Kimi K2.5: Redefining AI Workflow with Agent Swarm","description":"Kimi K2.5: Redefining AI Workflow with Agent Swarm On January 27, 2026, China’s Moonshot AI released Kimi K2.5 as open-source. Beyond being just another smart AI model, it’s changing how we …","url":"https://medium.com/@doubletaken/kimi-k2-5-redefining-ai-workflow-with-agent-swarm-2d50e76a7470","content":"# Kimi K2.5: Redefining AI Workflow with Agent Swarm | by Stella Jo | Feb, 2026 |

  26. 50
    I Used Kimi K2.5 Agent Swarm Mode — Here's What Happened. Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"warning":"Target URL returned error 403: Forbidden","title":"","description":"","url":"https://www.reddit.com/r/AISEOInsider/comments/1qt6vhl/i_used_kimi_k25_agent_swarm_mode_heres_what/","content":"You've been blocked by network security.\n\nTo continue, log in to your Reddit account or use your developer token\n\nIf you think you've been blocked by mistake, file a ticket below and we'll look into it.\n\n[Log in](https://www.reddit.com/login/)[File a ticket](

  27. 51
    ‪Zhilin Yang‬ - ‪Google Scholar‬. Retrieved March 22, 2026.

    {"code":200,"status":20000,"data":{"warning":"Target URL returned error 403: Forbidden","title":"Sorry...","description":"","url":"https://scholar.google.com/citations?user=7qXxyJkAAAAJ&hl=en","content":"## We're sorry...\n\n... but your computer or network may be sending automated queries. To protect our users, we can't process your request right now.","metadata":{},"external":{},"usage":{"tokens":31}},"meta":{"usage":{"tokens":31}}}

Production Credits

View full changelog
Research
gemini-2.5-flash-liteMarch 22, 2026
Written By
gemini-3-flash-previewMarch 22, 2026
Fact-Checked By
claude-haiku-4-5March 22, 2026
Ethics Review
claude-haiku-4-5March 22, 2026
Reviewed By
pending reviewMarch 24, 2026
This page was last edited on March 26, 2026 · First published March 24, 2026