New · Kimi K2.7 + Agent OS The framework · same model, three gears

The Gear Shift™ + Kimi K2.7.

One model. Three speed modes. Here's what it builds in a single shot — and every box below is live, so drag them around.

3D SOLAR SYSTEM
3D solar system built by Kimi K2.7● LIVE
PLAYABLE BREAKOUT
Playable neon Breakout built by Kimi K2.7● LIVE
50,000-PARTICLE GALAXY
50,000-particle galaxy built by Kimi K2.7● LIVE

Three of the nine builds in this guide — a solar system, a game, a galaxy. Each written in one shot. Each running live right now. Below: the same prompts run through all three gears, timed to the second.

Quality · K2.7 CodeFast · K2.7 HighSpeedNo-think · HighSpeed, thinking off

Most people run one model at one speed and take whatever they get. I wanted to know what the three gears actually buy you. So I gave all three the exact same three prompts — one shot, single HTML file, no follow-ups — and timed every run to the second.

I am not scoring these for you. The numbers below are measured. The builds are live and embedded — they run right in the page. You look, you decide.

3,200+Founders inside AIPB
$100k+/moAIPB MRR
38Countries · live members
Straight from the source

Moonshot's own HighSpeed announcement.

This is the news the whole guide is about. Moonshot just shipped a HighSpeed mode for Kimi K2.7 — they claim up to 6× faster. It's the lane the Fast and No-think gears run on. Watch it, then scroll down and see what it actually builds.

open the original tweet ↗

Start here · what the three gears are

One model. Three ways to run it.

Before the test, the plain-English version. Kimi K2.7 is one coding model. What changes between the modes is two switches — which lane it runs on, and whether it thinks out loud first.

QUALITY
The full model, thinking on. It works the problem out first, then writes. The careful one. kimi-for-coding
FAST ⚡
The HighSpeed lane — a faster way to serve the same model — with thinking still on. kimi-for-coding-highspeed
NO-THINK ⚡
HighSpeed lane, but the "think out loud" step switched off. It answers straight. The terse one. highspeed · thinking off
The three tests: a drag-to-rotate 3D solar system (Three.js, 8 planets + orbits + starfield) · a complete playable neon Breakout game (paddle, ball physics, bricks, score, lives, particles, start + game-over screens) · a 50,000-particle spiral galaxy (gold core, cyan + magenta arms, mouse orbit). Identical prompt to each mode. Nine builds total.
How it works · the three paths

What actually changes under the hood.

Your prompt is the same every time. After that, the three modes send it down three different paths. Here is the whole machine on one screen.

Your prompt one HTML file QUALITY Thinks first hidden scratchpad Standard lane normal serving speed BUILD FAST ⚡ Thinks first hidden scratchpad HighSpeed lane ~180 tok/s · up to 260 BUILD NO-THINK ⚡ No scratchpad reasons inline HighSpeed lane same fast serving BUILD
Two switches. The lane (standard vs HighSpeed) and the thinking step (on vs off). That's the whole difference.

See it? Quality and Fast both think first — they just run on different lanes. Fast and No-think both ride the HighSpeed lane — one keeps thinking, one drops it. Same model the whole way through.

Plain English · the two switches

What "thinking" and "HighSpeed" actually mean.

Two bits of jargon do all the work here. Here's each one in normal words — no hand-waving.

Switch 1 · the thinking step

Thinking on vs off

With thinking on, the model works the problem out on a hidden scratchpad first — plans the layout, the physics, the maths — then writes the code.

With thinking off, there's no scratchpad. It starts writing straight away.

The catch most people get wrong: off does not mean it skips the thinking. The reasoning just moves into the answer itself. For K2.7 that often costs time instead of saving it.

Switch 2 · the lane

Standard lane vs HighSpeed

HighSpeed is a faster way to serve the exact same model. Moonshot quotes around 180 tokens a second, up to 260 on short jobs — words landing on the page noticeably quicker.

It's the same brain. Same training. Just pushed through a faster pipe.

The honest part: HighSpeed is new and capacity-limited right now. When the servers are quiet it flies. When they're busy, it crawls. More on that below — it's the single biggest surprise in the data.

THINKING ON prompt scratchpad plans it out first answer THINKING OFF prompt answer reasons inline Off doesn't delete the thinking — it folds it into the answer. for K2.7, that often means the same work, just slower-felt — not a turbo button
Switching thinking off moves the reasoning, it doesn't remove it.
Old way vs new way

Why three gears beats one speed.

Here's the shift. Most people never touch the gear. They run one model at one speed for everything. The new way is to match the gear to the job.

Old way · one speed
~ every build, same wait
  • Run one model at one fixed speed for everything
  • Quick throwaway script? You wait the full careful run anyway
  • Important hero build? Same speed as a junk test — no extra care dialled in
  • No idea if "faster" mode would've built the same thing
  • When the server is busy, you just sit there guessing
  • Result: you take whatever the one setting gives you
New way · The Gear Shift™
~ match the gear to the job
  • Three gears on the same model — shift per task
  • Throwaway test? Floor it on No-think, move on
  • Daily driver work? Cruise on Fast — quicker when capacity's there
  • The hero build that ships? Shift down to Quality, let it think
  • One toggle in the Agent OS chat box — no config, no restart
  • Result: you control the trade between speed and polish, every single build

That's the whole idea behind the framework below. Same brain, three gears, and a simple rule for which one to grab.

The facts · part 1 · speed

How fast each gear built. Timed, not guessed.

Median time to write a complete, working single-file build across the three tests — start of prompt to file on disk. Lower is faster. These are real timings from my own runs.

Median build time — three one-shot builds

Measured on the coding-plan endpoint. Lower = faster.
No-think ⚡
47s
Fast ⚡
65s
Quality
85s

On the median, the fast gears pull ahead — No-think quickest, Fast second, Quality slowest. So far, no surprise. But the median hides the real story. Look what happens when you plot the full range of every run.

The spread — fastest to slowest run, each gear

Bar = range across all 3 builds. White tick = median. Scale 0–200s.
Quality
71–98s
No-think ⚡
39–125s
Fast ⚡
53–188s
0s50s100s150s200s

Here's the truth the median buries:

Quality was the steadiest gear of the three. Every run landed between 71 and 98 seconds. Tight.

The "fast" gears were the wild ones. Fast swung from 53 seconds all the way to 188. No-think from 39 to 125. Same prompt, same gear, triple the wait — because the HighSpeed lane was busy on the slow runs.

So the real headline isn't "fast is faster." It's: fast is faster when capacity is there, and a coin-flip when it isn't. Quality is slower but you can set your watch by it.

The facts · part 2 · every run

All nine builds. Nothing hidden.

Here is every single run — build time, file size, and whether it ran. No averages, no spin. The raw card for each of the nine.

SOLAR · Quality71s
10.5 KB written · ran ✓
SOLAR · Fast65s
7.5 KB written · ran ✓
SOLAR · No-think47s
7.3 KB written · ran ✓
GAME · Quality85s
19.9 KB written · ran ✓
GAME · Fast188s
17.5 KB written · ran ✓
GAME · No-think125s
18.6 KB written · ran ✓
GALAXY · Quality98s
8.9 KB written · ran ✓
GALAXY · Fast53s
8.6 KB written · ran ✓
GALAXY · No-think39s
8.8 KB written · ran ✓

Two facts worth pulling out of that grid, because they cut against the hype:

Every build ran on the first try. Nine for nine. No mode shipped a broken file. Whatever else changes, K2.7 lands working code in one shot across all three gears.

The fast gears didn't write less code. Quality wrote 38 KB total across its three builds. Fast wrote 34 KB. No-think wrote 35 KB. Nearly the same. So "faster" here isn't "lazier" — it's the same amount of work, served quicker (when the lane's clear).

✦ ✦ ✦
See for yourself · test 1 · 3D solar system

Three.js solar system, one shot.

Sun plus all eight planets orbiting at different speeds, an orbit ring each, a starfield, drag to rotate, scroll to zoom. Same prompt to all three gears.

"Build a 3D solar system: the sun plus all 8 planets orbiting at different speeds with small labels, an orbit ring per planet, and a starfield. Let me drag to rotate the camera and scroll to zoom. Use three.js from a CDN."
QUALITY · 71s
Quality solar system● LIVE
FAST ⚡ · 65s
Fast solar system● LIVE
NO-THINK ⚡ · 47s
No-think solar system● LIVE

drag to rotate · scroll to zoom · each one is the live build, not a video

QUALITY
71s
10.5 KB · ran ✓
FAST ⚡
65s
7.5 KB · ran ✓
NO-THINK ⚡
47s
7.3 KB · ran ✓

What's on screen, plainly: all three built a working solar system — sun, eight planets, orbit rings, labels, a ringed Saturn, drag-to-rotate. Quality spaced the orbits the most evenly. No-think put the most planet labels on screen at once. Fast came out a touch sparser. On this one, you'd struggle to pick which gear built which — and the fastest build (No-think, 47s) is right in the mix.

See for yourself · test 2 · playable neon Breakout

A complete playable game, one shot.

Paddle, real ball physics, a grid of neon bricks, score, lives, a particle burst on every break, a start screen and a game-over screen. The hardest test — it has to actually be fun.

"Build a COMPLETE polished playable neon Breakout game in one HTML file: a paddle, real ball physics, a grid of colourful bricks, score, lives, particle burst when a brick breaks, a start screen and a game-over screen. Mouse or arrow keys move the paddle."
QUALITY · 85s
Quality breakout game● LIVE
FAST ⚡ · 188s
Fast breakout game● LIVE
NO-THINK ⚡ · 125s
No-think breakout game● LIVE

click to start · move the paddle with your mouse · break every brick

QUALITY
85s
19.9 KB · ran ✓
FAST ⚡
188s
17.5 KB · ran ✓
NO-THINK ⚡
125s
18.6 KB · ran ✓

What's on screen, plainly: all three are genuinely playable — paddle, ball physics, neon brick grid, score, lives, start and game-over screens. Fast and No-think both wrapped the play-field in a 3D framed border that adds depth. Quality kept it flat and clean. Play all three — they feel the same in your hands.

Now look at the clock. This is the run where Fast took 188 seconds — slower than Quality's 85, on an identical prompt — because the HighSpeed lane was throttled mid-build. Same playable result. More than double the wait. That single number is the whole capacity story, caught in one test.

See for yourself · test 3 · 50,000-particle galaxy

A spiral galaxy, one shot.

50,000 glowing particles, a warm gold core fading to cyan and magenta arms, a faint starfield, slow auto-rotate, mouse to orbit. The prettiest test — and the one where the gears stop agreeing.

"Build a mesmerising 3D spiral galaxy of 50,000 glowing particles that slowly auto-rotates, with a bright warm gold core fading to cyan and magenta arms, plus a faint starfield behind it. Let me orbit with the mouse. Use three.js from a CDN."
QUALITY · 98s
Quality spiral galaxy● LIVE
FAST ⚡ · 53s
Fast spiral galaxy● LIVE
NO-THINK ⚡ · 39s
No-think spiral galaxy● LIVE

drag to orbit · scroll to zoom · 50,000 live particles each

QUALITY
98s
8.9 KB · ran ✓
FAST ⚡
53s
8.6 KB · ran ✓
NO-THINK ⚡
39s
8.8 KB · ran ✓

What's on screen, plainly: this is where the builds diverge. Quality drew a defined multi-arm spiral — you can trace distinct magenta and cyan lanes winding out from the core. Fast and No-think both made a softer, diffuse spiral with a bright centre: pretty, but the arms blur together. Same prompt, visibly different result. Don't take my word for it — they're running side by side right above. Look at the three and judge for yourself.

✦ ✦ ✦
State the truth · what the nine builds show

The honest read. No scores from me.

I'm not handing you a winner out of ten. Here's what the measured data and the live builds actually show — you've seen all of it above.

Speed is real but noisy. The fast gears win the median. They also swing the hardest. Quality is the slowest and the steadiest. If you need a predictable wait, that matters more than a fast average.

No-think is not a turbo button. It came out quickest on the median here — but switching thinking off doesn't reliably speed K2.7 up. It reasons inline instead. The fast number is partly luck of the draw on the lane, not the switch doing magic.

On simple, mechanical builds, the gears tie. The solar system and the Breakout game came out the same across all three. You genuinely cannot tell which gear built which. For that kind of work, grab the fast gear and move on.

On builds that lean on taste, thinking shows. The galaxy is the tell. The thinking gears drew defined spiral arms. The diffuse one came from the fastest run. When detail and look are the whole point, the extra thinking is exactly what you're paying for — and you can see it with your own eyes.

So the truth is simple: there's no best gear. There's a right gear for the job. Which is the whole point of the framework.

The framework · The Gear Shift™

The Gear Shift™.

One model. Three gears. A dead-simple rule for which one to grab. Stop running everything at one speed — shift to match the job in front of you.

i.

Shift down → Quality

For the hero build. The thing that ships. Anything where the look or the detail is the whole point.

It thinks first, so it lands the polish. Slower — but rock-steady, 71–98s every time.

use for: the build that matters
ii.

Shift up → Fast

The daily driver. Iterating, prototyping, "just get me a working version."

HighSpeed with thinking still on — quicker when the lane's clear, a coin-flip when it's busy. Same code quality.

use for: most of your day
iii.

Floor it → No-think

Quick jobs. Throwaway tests. Simple utilities where polish doesn't matter.

HighSpeed, thinking off — fastest on the median. Don't ask it to think hard about a hard problem.

use for: get-it-out-the-door work

That's it. Three gears, one shift rule. You stop taking whatever one setting gives you, and you start driving.

Should you switch gears · the three beliefs holding you back

What stops people using all three.

Three things keep people stuck on one gear. All three are wrong. Here's the swap.

Wrong: "Faster mode means worse code. I'll just always run the careful one."

Right: On simple, mechanical builds the fast gears tied the careful one — same working result, less wait. The gap only showed up on the one build that leaned on taste. For most of your day, fast costs you nothing.

Wrong: "No-think is the turbo button. Switch thinking off and everything's faster."

Right: Switching thinking off doesn't reliably speed K2.7 up — it reasons inline instead. It won the median here partly on luck of the lane. Treat it as the terse gear for simple jobs, not a magic speed-up for hard ones.

Wrong: "One model has one speed. There's nothing to tune."

Right: One model, three gears, one toggle. The same brain can run careful-and-steady or fast-and-loose depending on the job. The people who learn to shift get the best of both — and never wait longer than they have to.

Don't take my word for it

3,200+ founders inside the Boardroom are already building with this stack. Their wins — real businesses, real results — are documented here.

Read the member wins doc →
The verdict · which gear, when

So which gear do you actually grab?

Speed says No-think. Steadiness says Quality. Reality says it depends what you're building. Here's the simple call.

Quality

The hero shot. The demo, the landing page, the build you'll ship. Anything where the look or the detail is the point. Worth the steady extra 20–40s.

Fast ⚡

The daily driver. Iterating and prototyping. When HighSpeed has capacity it's genuinely quicker with no quality drop. When it doesn't, you'll feel it — but the code's the same.

No-think ⚡

The terse one. Quick utilities, transforms, simple builds. Fastest on the median — just don't lean on it for a problem that needs real thought.

Do it yourself · the setup

How to get all three gears one click apart.

Two ways to run this. The command line for tinkerers, or the toggle in the Agent OS for everyone else.

The command line

Each gear is a model alias in your Kimi config. One line in the chat box flips the lane:

# shift down — careful build kimi -m kimi-code/kimi-for-coding "build the thing" # shift up — HighSpeed, thinking on kimi -m kimi-code/highspeed "build the thing" # floor it — HighSpeed, thinking off kimi -m kimi-code/highspeed-nothink "build the thing" # or a one-off HighSpeed run without changing your default khs "build the thing"

The Agent OS toggle

Inside the Agent OS, the same three gears live as a toggle right in the chat box — Quality, Fast, No-think. Click the gear, type the build, watch it preview live as it writes. No config, no restart, no command to remember. That's the version 3,200+ members run.

Read this before you quote any of it ↓
Run your own

Want all three gears in one dashboard?

All three Kimi gears live inside the Agent OS — a Quality / Fast / No-think toggle right in the chat box, builds previewing live as they write. Run your own head-to-heads, time them yourself, keep the winners.

Get the Agent OS →

link in the description ↗