How Flask Creator Armin Ronacher Uses Claude Code [With Examples]

Armin Ronacher — An open-source developer known for the Python web framework Flask, the templating engine Jinja2, and the error-monitoring service Sentry, among others. He continually shares his insights on agentic development through his blog lucumr.pocoo.org.

Armin Ronacher is known for treating Claude Code (Claude CLI) as "a fast but instruction-faithful junior engineer," and for optimizing the environment itself for the agent's work loop. In this article, we dig into his practices, quoting concrete examples from his blog.

This article is our own research-based summary of Armin Ronacher's blog posts. The quotations are excerpts from the original posts, with sources listed at the end of the article. Please always check the original posts for the precise nuance of the wording.

1. A Setup That Is "Cheap, Isolated, and Full Throttle"

The first surprise is his sense of cost. He states clearly that he uses not the expensive Opus but only the Sonnet model on the $100-a-month Max plan.

I predominently use Claude Code with the cheaper Max subscription for $100 a month. I exclusively use the cheaper Sonnet model.
— Agentic Coding Recommendations (2025-06-12)

He then uses a claude-yolo alias that fully skips permission prompts (it is really claude --dangerously-skip-permissions), and neutralizes its danger with environment isolation via Docker. The idea is: "Instead of running full throttle in the open, run it inside a box where any damage stays contained."

# Run at full throttle without showing any permission prompts (conceptual)
alias claude-yolo='claude --dangerously-skip-permissions'

# But seal off accidents by confining the whole dev environment inside Docker

Caution: --dangerously-skip-permissions is a dangerous option, just as its name suggests. What he emphasizes is that "if you run at full throttle, always pair it with isolation (such as Docker)." It should not be used carelessly in production environments or in environments that hold credentials.

2. Logging Practices That Let the Agent "Self-Diagnose"

One of his especially practical tricks is to record all output to a file as well. This lets the agent re-read the logs and pin down the cause on its own.

I am not only printing the output to my terminal, I'm also always logging it to a file.
— Agentic Coding Recommendations (2025-06-12)

Specifically, he starts the services with make dev and makes the logs followable with make tail-log. For the process manager he uses a fork of shoreman, which writes a pidfile to prevent double startup. He even outputs the sign-in emails (confirmation links) to the logs so that the agent can complete the authentication flow by itself, and he documents that procedure in CLAUDE.md — that thorough.

# Output is also kept in a file with timestamps, so the agent can read it and self-diagnose
make dev        # Start services via the process manager (output also goes to a file)
make tail-log   # Check the latest logs

3. Writing "Code That an LLM Can Read Easily"

His most distinctive argument is the stance of designing code with the agent's ease of understanding as the top priority. His blog lists concrete rules (all quoted verbatim).

Prefer functions with clear, descriptive and longer than usual function names over classes.

Avoid inheritance and overly clever hacks.

Use plain SQL. I mean it.

Keep important checks local.
— Agentic Coding Recommendations (2025-06-12)

Longer, descriptive function names over classes — easy for both humans and agents to find with grep
Avoid inheritance and overly clever hacks
"Use plain SQL. I mean it." — trust the quality of the raw SQL the agent writes over an ORM
Keep important checks local — make permission and security checks visible right next to the code they relate to

The same logic runs through his language choices. He prefers Go for the backend, citing "explicit context.Context," "cached test results that make the iteration loop efficient," "structural interfaces that are easy for an LLM to interpret," and "strong backward compatibility with little churn in the ecosystem." And he notes that by 2026, for the same reasons, he had started reaching for TypeScript over Python.

I'm now routinely reaching for JavaScript in places where I would have used Python. Not because I love it or the ecosystem is better, but because the agent does much better with TypeScript.
— A Language For Agents (2026-02-09)

4. His Thesis: "Code (CLI) Over MCP"

He is skeptical of MCP (Model Context Protocol) servers and argues that agents should write and run code rather than call tools. The test he recommends to readers is this.

Try completing a GitHub task with the GitHub MCP, then repeat it with the gh CLI tool. You'll almost certainly find the latter uses context far more efficiently.
— Tools: Code Is All You Need (2025-07-03)

A large number of dedicated tools eats up context. In his words, "the more tools you add, the more you contribute to context rot." Instead, he argues, it is more composable and efficient to have the agent program against a single stateful "ubertool" (effectively a Python interpreter). It is an extension of the old maxim "replace yourself with a shell script" to delegation to LLM-generated code.

5. Publishing What "Didn't Work" Too

He frankly records not only his successes but also the techniques that had no effect ("Things that didn't work"). This is extremely valuable for practitioners.

Custom slash commands (such as /fix-bug, /commit, /add-tests) — they never stuck, because "it ends up being no different from pasting an issue URL" and "they don't fit my own style"
Hooks — "I tried hard to use them, but I saw no improvement in efficiency." Instead he steers tool selection with a PATH interceptor
Print mode (-p) — a good concept, but "slow and hard to debug"
Sub-agents — starting a new session or writing his thoughts out to Markdown often gave better results

In short, his style is to determine through experiments that "a feature touted as a best practice does not necessarily work for your own workflow." He also notes that by the end of 2025 he had shifted from Cursor-centric work to a largely hands-off, Claude Code-centric approach, describing himself as wanting to be "an engineering lead directing a virtual programmer intern rather than the one hitting the keyboard."

Our take: Ronacher's practices show that "polishing the loop (the environment)" matters more than "polishing the prompt." Logging to a file, writing code the agent can read easily, and setting up the CLI — these have been considered good design even before AI, and they tell us that good engineering translates directly into agent productivity.

Series: Learning Claude Code from Renowned Engineers

Overview: Learning Claude Code from Renowned Engineers
Armin Ronacher (this article)
Simon Willison — Designing the Agent Loop
Boris Cherny — How Claude Code's Own Developer Uses It
Mitchell Hashimoto — Harness Engineering

References

Armin Ronacher, "Agentic Coding Recommendations" (June 12, 2025) — https://lucumr.pocoo.org/2025/6/12/agentic-coding/
Armin Ronacher, "Tools: Code Is All You Need" (July 3, 2025) — https://lucumr.pocoo.org/2025/7/3/tools/
Armin Ronacher, "Your MCP Doesn't Need 30 Tools: It Needs Code" (August 18, 2025) — https://lucumr.pocoo.org/2025/8/18/code-mcps/
Armin Ronacher, "Things that didn't work" (July 30, 2025) — https://lucumr.pocoo.org/2025/7/30/things-that-didnt-work/
Armin Ronacher, "A Language For Agents" (February 9, 2026) — https://lucumr.pocoo.org/2026/2/9/a-language-for-agents/
Armin Ronacher, "A Year of Vibes" (December 22, 2025) — https://lucumr.pocoo.org/2025/12/22/a-year-of-vibes/

Frequently Asked Questions (FAQ)

Why does Armin Ronacher use claude-yolo together with Docker?

claude-yolo is an alias for claude --dangerously-skip-permissions, which runs the agent at full throttle without showing any permission prompts. Ronacher neutralizes the danger of this by isolating the environment with Docker, running everything inside a box where any damage stays contained. This lets him have both full speed and safety.

What does Armin Ronacher mean by code that an LLM can read easily?

It means code that follows rules such as: prefer longer, descriptive function names over classes; avoid inheritance and overly clever hacks; use plain SQL instead of an ORM; and keep important permission and security checks right next to the code they protect. The top priority is making it easy for the agent to find code with grep and to interpret it.

Why does Armin Ronacher recommend CLI over MCP?

Because a large number of dedicated tools eats up context and causes context rot. He points out that comparing the GitHub MCP with the gh CLI shows the latter uses context far more efficiently, and argues that having the agent write and run code is more composable and efficient than making it call tools.