Question 1

How does Terse reduce Claude Code token costs?

Accepted Answer

Terse auto-detects Claude Code sessions via process scanning and attaches a floating optimizer bar. It compresses every prompt you type (removing filler, typos, and redundancy), detects when Claude Code makes duplicate tool calls, flags redundant file reads, and tracks per-turn token costs in real time. In a typical 2-hour session, this reduces total token usage by 40-89%.

Question 2

Does Terse work with Claude Code automatically?

Accepted Answer

Yes. Terse scans running processes every 5 seconds and auto-detects Claude Code as soon as it launches. A floating optimization bar appears above your terminal. You do not need to configure anything — detection is fully automatic.

Question 3

What is CLAUDE.md rule generation?

Accepted Answer

After monitoring a Claude Code session, Terse analyzes patterns — repeated tool calls, common prompt structures, redundant context — and generates CLAUDE.md rules that prevent these inefficiencies in future sessions. Placing these rules in your project's CLAUDE.md file trains Claude Code to behave more efficiently from the start of each session.

Question 4

How much does a Claude Code session cost without Terse?

Accepted Answer

A typical 2-hour Claude Code session with Claude Opus generates approximately 210,000 tokens of CLI output noise (file reads, tool calls, repeated context). At Claude Opus pricing of $15-$75 per million tokens, that is $3–$15 per session. Heavy users doing 3-4 sessions per day spend $200-500/month. Terse compresses this output by 40-89%, directly reducing that cost.

Question 5

Does Terse intercept or send my Claude Code prompts anywhere?

Accepted Answer

No. Terse is 100% on-device. It reads text from your terminal using macOS Accessibility APIs, processes it locally using a Rust engine, and writes the optimized text back to the same terminal. No text is sent to any server. Your code and prompts never leave your machine.

Question 6

What is a duplicate tool call in Claude Code?

Accepted Answer

A duplicate tool call happens when Claude Code reads the same file multiple times in one session, or runs the same search query twice. Each tool call consumes tokens for the call itself plus the result in context. Terse's Agent Monitor detects these patterns in real time and alerts you, so you can interrupt and redirect Claude before the wasted tokens accumulate.

Cut Your Claude Code Bill by 40–89%

How Terse Works with Claude Code

Auto Session Detection

Prompt Compression

Duplicate Tool Call Detection

Redundant File Read Flagging

CLAUDE.md Rule Generation

Per-Turn Cost Tracking

The Claude Code Token Problem

Frequently Asked Questions

Also Works With

Stop Burning Tokens on Noise