Chutes Integrates Kimi K2 Thinking for Decentralized Inference

Listen to this article

Read Time:1 Minute, 47 Second

On November 7, 2025, Chutes announced that Kimi K2 Thinking, a newly released open-source model from Moonshot AI, is now available on its network.

In a post on X, Chutes highlighted the model’s features, including a 256K context window, advanced reasoning, and multi-step tool use—all powered by decentralized inference. Users can access the model directly through the Chutes app.

Kimi K2 Thinking just landed on Chutes 🪂
256K context, agentic reasoning, and true multi-step tool use, now powered by decentralized inference at scale.

Try it on Chutes today: https://t.co/CpkWaaTF7e #KimiK2 #Chutes #OpenSource pic.twitter.com/YbeURzZbeJ
— Chutes (@chutes_ai) November 7, 2025

The update underscores how Chutes’ serverless infrastructure allows users to deploy and scale sophisticated models without managing hardware, in line with its goal to democratize access to computation.

Table of Contents

About Kimi K2 Thinking

Kimi K2 Thinking is a Mixture-of-Experts model built by Moonshot AI, a startup supported by Alibaba. Released on November 6, it features one trillion total parameters, with 32 billion activated per inference, ranking it among the largest open-weight models available.

🚀 Hello, Kimi K2 Thinking!
The Open-Source Thinking Agent Model is here.

🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%)
🔹 Executes up to 200 – 300 sequential tool calls without human interference
🔹 Excels in reasoning, agentic search, and coding
🔹 256K context window

Built… pic.twitter.com/lZCNBIgbV2
— Kimi.ai (@Kimi_Moonshot) November 6, 2025

Developed as a “thinking agent,” it shows strong performance in reasoning, search, coding, and long-horizon tasks, with leading benchmark results such as 44.9% on HLE, 60.2% on BrowseComp, and 93% on τ²-Bench Telecom.

The model can execute 200 to 300 tool calls in sequence without human input and supports test-time scaling for both thinking tokens and tool usage. It is optimized for efficiency with INT4 precision and trained using the Muon optimizer. Kimi K2 Thinking is available through Moonshot’s API, its open weights on Hugging Face, and now via Chutes.

Analysts have described the release as a breakthrough for open innovation in China, comparing its impact to DeepSeek’s earlier success. The training reportedly cost around $4.6 million and follows earlier Kimi K2 instruct versions from July and September 2025.

About Chutes

Chutes operates as Subnet 64 on the Bittensor network, providing decentralized serverless compute for deploying and scaling large models. It handles trillions of tokens monthly and is used by applications such as Squad and Chutes Chat.

The addition of Kimi K2 Thinking represents a step toward connecting frontier models with decentralized infrastructure, reducing dependence on centralized providers, and expanding the reach of open computation.

Chutes Integrates Kimi K2 Thinking for Decentralized Inference

About Kimi K2 Thinking

About Chutes

Subscribe to receive The Tao daily content in your inbox.

Like this:

Be the first to comment

Leave a Reply Cancel reply

Bittensor TAO Bridges: Massive Catalyst for Buying Pressure

About Kimi K2 Thinking

About Chutes

Subscribe to receive The Tao daily content in your inbox.

Like this:

Related Articles

Inside Synth’s 110% Win and Its Bold Leap into High-Frequency Trading

Like this:

The Vampirism Paradox: Why TAO Wins

Like this:

How To Research and Invest in TAO Subnets

Like this:

Be the first to comment

Leave a Reply Cancel reply