Gemini CLI Review 2026: Pricing, Benchmarks, Community

Overview

Google quietly turned Gemini 3 Pro into a paid-only feature on March 25, 2026. The free tier is Flash-only now. If you were riding the free tier for the Pro-quality experience, that's gone.

What's left is still the most generous free tier in the category -- 1,000 Flash requests per day with a personal Google account -- and that's genuinely useful. Gemini 3.1 Pro is a real upgrade when you pay for it, competitive with Claude on a good day and a coin toss on a bad one. The community is pretty clear: good for free and prototyping, Claude still wins for production.

The rest of the package is solid: Jules for async background work, Conductor for Markdown-based project plans and automated reviews, Plan Mode on by default, native Google Workspace hooks. 102K GitHub stars, 100+ contributors, Apache 2.0. Actually open source, not "source-available open."

Key Differentiators

Biggest free tier in the category -- 1,000 Flash requests/day, no credit card. Pro models now cost money (since March 25, 2026).
Jules -- Async agent that clones the repo, works in a VM, submits PRs. Fire-and-forget for bugs and refactors.
Conductor -- Markdown plans + specs, automated reviews (guidelines, tests, security). Closest thing to spec-driven workflow in a vendor CLI.
1M token context -- Tied for largest in the category.
Apache 2.0, actually -- Permissive license, 100+ contributors. Qwen Code CLI forked from it.
Plan Mode on by default -- Read-only analysis before it touches your code.

Pricing

Plan	Price	Details
Free	$0	60 req/min, 1,000 req/day with personal Google account
Google AI Pro	Subscription	Higher quotas, bundled with Google One
Google AI Ultra	Subscription	Highest quotas
Pay-as-you-go	Variable	Via Google AI API pricing

Strengths

Cheapest real path into agentic coding -- nothing else beats 1,000 free Flash requests/day
Gemini 3.1 Pro is a real step up, competitive with Claude on most tasks
1M context handles large codebases without chunking gymnastics
Jules for background async work
Conductor for Markdown-driven project management and automated reviews
Native Google Workspace hooks (Gmail, Drive, Calendar)
Apache 2.0, active development, multiple releases a month
gVisor sandboxing by default, experimental LXC

Weaknesses

Gemini only -- no model swap
429s are the top complaint, and the March 25 "paid users get priority" fix hasn't cleaned it up; paying customers are still hitting the same errors
Quality is inconsistent -- "either great or garbage and it's a coin toss"
Tool-call and formatting errors in agent workflows
Pricing is a maze -- Google One AI Pro, Workspace, API pricing all overlap
Workspace plan users sometimes can't even access the latest models
Antigravity IDE is still unstable in preview
Verdict from the community: fine for free and prototyping, reach for Claude for production

Community Sentiment

What People Love

The free tier is absurd -- 60 req/min, 1,000 req/day, no card. People cancel competing subscriptions over it.
Google One bundle -- 2TB + AI Pro + NotebookLM + Antigravity + Gemini CLI. "I decided to test antigravity with Gemini 3.1 Pro and... it's really good! I would not put it above Opus but... it doesn't need to actually." -- u/razer54, r/Bard
Gemini 3.1 Pro quality jump -- Crosses the "good enough" threshold for personal coding (95 upvotes on launch thread).
Plan Mode on by default -- Read-only analysis before it touches code.
Actually open source -- Apache 2.0, 100+ contributors, Qwen Code CLI forked from it.

Common Complaints

429s everywhere -- "Constant 429s"; "servers are so overloaded that all models are practically not usable" -- r/Bard, multiple threads. The top complaint, bar none.
Tool calls break -- "Gemini often runs into tool call errors, formatting issues, or retries tasks multiple times" -- u/mugeshrao142, r/vibecoding
Coin-toss quality -- "Gemini however is either great or garbage and it's a coin toss" -- u/ZeidLovesAI, r/vibecoding
Pricing maze -- "there's so many different ways/plans to pay for gemini it's very confusing" -- u/Just_Lingonberry_352, r/Bard
Still behind Claude for agentic work -- "Gemini is good at a lot of things, agentic coding isn't one of them" -- u/mxforest, r/Bard

Notable Quotes

"I decided to test antigravity with Gemini 3.1 Pro and... it's really good! I would not put it above Opus but... it doesn't need to actually." -- u/razer54, r/Bard (65 upvotes)

"Gemini however is either great or garbage and it's a coin toss" -- u/ZeidLovesAI, r/vibecoding

"I have danced this dance before. You will be back to Claude before your subscription renews. Gemini is good at a lot of things, agentic coding isn't one of them." -- u/mxforest, r/Bard

"I hope they don't nerf it like they did with 3.0" -- u/cryptobrant, r/Bard

Performance Notes

A note on benchmarks: SWE-bench measures models and their scaffolding, not the CLI tools developers use. Gemini CLI as a product has never been submitted to SWE-bench. There is no widely-adopted benchmark for comparing coding agents head-to-head.

Community consensus: The free tier is the draw. Gemini 3.1 Pro is a real quality jump and crosses the "good enough" bar for personal coding. But quality is inconsistent -- "either great or garbage" -- and 429s remain the #1 complaint. Good for free and prototyping, Claude for production.

Recent Changes (2025-2026)

The big one

Pro models went paid-only (March 25, 2026) -- Free tier is Flash only now. Gemini 3 Pro / 3.1 Pro require a paid subscription (GitHub Discussion #22970). Paid accounts also got capacity priority -- though paid users still report 429s through rollout.

March-April 2026

v0.39.0-preview (April 14) -- Unified subagent tooling (single invoke_subagent replaces legacy tools); /memory inbox for reviewing extracted skills.
v0.38.2 (April 17) -- Context Compression Service, Persistent Policy Approvals (stop confirming the same tool every session), Background Process Monitoring.
v0.37.0 (April 8) -- Dynamic Sandbox Expansion with worktree support on Linux and Windows.
v0.36.0 (April 1) -- Native macOS/Windows sandboxing for subagents; native Git worktree support.
v0.35.0 (March 24) -- SandboxManager (bubblewrap/seccomp on Linux); customizable keybindings (Kitty protocol).
v0.34.0-preview.1 (Mar 12) -- Plan Mode on by default, experimental LXC sandbox, native gVisor.
v0.31.0 (Feb 27) -- Gemini 3.1 Pro Preview support, experimental browser agent.

Earlier

Conductor automated reviews (Mar 2026) -- Guideline enforcement, test validation, security scanning.
Jules extension -- Async agent delegation for background coding.
Gemini 3 Flash in CLI (late 2025) -- Brought the free tier up to competitive coding quality.
Google Workspace integration -- Gmail, Drive, Calendar natively in CLI.

Integration Ecosystem

MCP: Full server support, admin allowlisting, Google published Codelab ("Build an MCP Server with Gemini CLI and Go")
Extensions: Conductor (project mgmt + automated reviews), Jules (async agents), Google Workspace, Maestro (community multi-agent orchestration)
Extension directory: geminicli.com/extensions
IDE: Antigravity (Google's VS Code fork, unstable preview), VS Code integration
Context files: GEMINI.md for directory-specific instructions (analogous to CLAUDE.md)
Sandboxing: Docker with gVisor (runsc), experimental LXC/LXD containers
Skills: Experimental support in recent versions

CodeMySpec Integration

The free tier and Conductor make Gemini CLI a strong fit for anyone exploring spec-driven development without opening a wallet.

Context files: GEMINI.md is the native project-context file, same idea as CLAUDE.md. CodeMySpec generates GEMINI.md from specs, and Gemini CLI picks it up automatically.
MCP support: Full server support with admin allowlisting. CodeMySpec serves specs via MCP. Google even published a Codelab on building MCP servers -- the protocol investment is real.
Hooks support: No pre/post hook system. Conductor's automated reviews (guidelines, tests, security) are the closest thing and can carry spec-compliance checks.
Subagent support: Jules handles async autonomous work (clones repo, works in VM, submits PRs). Plan Mode research subagents can analyze a spec before anyone writes code. Thinner than Claude Code's Agent Teams but usable.
Skills/commands: Experimental skills support in recent versions. Conductor's Markdown plans line up philosophically with CodeMySpec -- specs feed directly into Conductor.
Memory/persistence: No persistent memory. GEMINI.md + MCP + Conductor's Markdown plans are your persistence layer.

Gemini CLI in 2026: Features, Pricing, Benchmarks, and Community Sentiment