Rubber Duck - Cross-model reviews in GitHub Copilot CLI

Flowtica Scribe

•3mo ago

Rubber Duck is a new experimental mode in GitHub Copilot CLI that uses a second model from a different AI family to review plans, implementations, and tests before the agent moves forward. It is designed to catch architectural mistakes, edge cases, and cross-file issues earlier.

Replies

Best

Flowtica Scribe

Hunter

📌

Hi everyone!

@Github Copilot CLI added Rubber Duck in experimental mode.

It triggers a second model (can be a different family) that reviews the main agent’s plan and output at key moments — after planning, after big implementations, before tests. Think of it as an independent rubber duck that actually talks back.

Early numbers are solid: Claude Sonnet + Rubber Duck closes 74.7% of the gap to Opus on SWE-Bench Pro, especially on complex multi-file work.

If you’re already living in Copilot CLI, just turn on /experimental and you’ll probably stay afloat🛟

Report

3mo ago

brag.fast

The good ol' Rubber Duck. Every developer knows how useful that can be (I use colleagues as rubber duckies sometimes tho, they don't always appreciate).

Report

3mo ago