"Max subscribers are hitting their 5 hour usage limits in 30-40 minutes with a s...

namelosw · 2026-02-06T04:55:38 1770353738

This hasn't been my experience either. I personally find the max plan is very generous for day-to-day usage. And I don't even use compact manually.

However, when I tried out the SuperPower skill and had multiple agents working on several projects at the same time, it did hit the 5-hour usage limit. But SuperPower hasn't been very useful for me and wastes a lot of tokens. When you want to trade longer running time for high token consumption, you only get a marginal increase in performance.

So people, if you are finding yourself using up tokens too quickly, you probably want to check your skills or MCPs etc.

bicepjai · 2026-02-06T16:46:58 1770396418

As a regular user, I hit these walls so often. I am experimenting with local model and open code. I am hoping to see some good results with qwen3 coder

mcast · 2026-02-06T02:29:44 1770344984

It's known that Anthropic's $20 Pro subscription is a gateway plan to their $100 Max subscription, since you'll easily burn your token rate on a single prompt or two. Meanwhile, I've had ample usage testing out Codex on the basic $20 ChatGPT Plus plan without a problem.

As for Anthropic's $100 Max subscription, it's almost always better to start new sessions for tasks since a long conversation will burn your 5-hour usage limit with just a few prompts (assuming they read many files). It's also best to start planning first with Claude, providing line numbers and exact file paths prior, and drilling down the requirements before you start any implementation.

deaux · 2026-02-06T03:07:43 1770347263

> It's known that Anthropic's $20 Pro subscription is a gateway plan to their $100 Max subscription, since you'll easily burn your token rate on a single prompt or two.

I genuinely have no idea what people mean when I read this kind of thing. Are you abusing the word "prompt" to mean "conversation"? Or are you providing a huge prompt that is meant to spawn 10 subagents and write multiple new full-stack features in one go?

For most users, the $20 Pro subscription, when used with Opus, does not hit the 5-hour limit on "a single prompt or two", i.e. 1-2 user messages.

pastel8739 · 2026-02-06T07:28:37 1770362917

Today I literally gave Claude a single prompt, asking it to make a plan to implement a relatively simple feature that spanned a couple different codebases. It churned for a long time, I asked a couple very simple follow up questions, and then I was out of tokens. I do not consider myself to be any kind of power user at all.

deaux · 2026-02-06T08:18:38 1770365918

The only time I've ever seen this happen is when you give it a massive codebase, without any meaningful CLAUDE.md to help make sense of it and no explicitly @ mentioning of files/folders to guide, and then ask it for something with huge cross-cutting.

> spanned a couple different codebases

There you go.

If you're looking to prevent this issue I really recommend you set up a number of AGENTS.md files, at least top-level and potentially nested ones for huge, sprawling subfolders. As well as @ mentioning the most relevant 2-3 things, even if it's folder level rather than file.

Not just for Claude, it greatly increases speed and reduces context rot for any model if they have to search less and more quickly understand where things live and how they work together.

visarga · 2026-02-06T10:55:04 1770375304

I have a tool that scans all code files in a repo and prints the symbols (AST based), it makes orienting around easy, it can be scoped to a file or folder.

fragmede · 2026-02-06T14:19:37 1770387577

> spanned a couple different codebases

It's either that, or you have a lot of skills loaded or something. I use Claude for hours a day and usually don't run out of tokens.

mcast · 2026-02-06T15:28:05 1770391685

I should note this only happens in Claude Code, not the web UI. Since CC is agentic and spawns subagents on prompts that require a lot of thinking.

It will spend a lot of time grokking the codebase, which would consume more tokens on larger projects.

Foobar8568 · 2026-02-06T06:52:28 1770360748

I am on $100 max subscription, and I rarely hit the limit, I used to but not anymore, but then again, I stopped building two products at the same time and concentrate to finish up the first/"easiest" one.

8cvor6j844qw_d6 · 2026-02-06T14:02:03 1770386523

I'm considering dropping the Max plan for the API.

Using the Max plan with tools like OpenClaw violates Anthropic's ToS [1].

The API gives you the same flexibility without the risk of getting your account suspended.

[1] https://www.anthropic.com/legal/consumer-terms (Section 3-7)

ipaddr · 2026-02-06T17:50:42 1770400242

At ten times the price. At least you keep the ability for them to charge you a hundred a month.

runako · 2026-02-06T03:25:22 1770348322

> you'll easily burn your token rate on a single prompt or two

My experience has been that I can usually work for a few hours before hitting a rate limit on the $20 subscription. My work time does not frequently overlap with core business hours in PDT, however. I wonder whether there is an aspect of this that is based on real-time dynamic usage.

cantalopes · 2026-02-06T08:51:36 1770367896

i never had these issues with gemini cli using google vertex endpoint, and i never even reached $50 per month

i don't want to think about how to hack a tool i'm paying for not locking me out because "i promped wrong"

daliusd · 2026-02-06T13:22:02 1770384122

I wonder what do you mean by "if you hit compact". Claude Code does not show used tokens.

ben_w · 2026-02-06T13:49:38 1770385778

When I used it before Christmas (free trial), it very visibly paused for a bit every so often, telling me that it was compressing/summarising its too-full context window.

I forget the exact phrasing, but it was impossible to miss unless you'd put everything in the equivalent of a Ralph loop and gone AFK or put the terminal in the background for extended periods.

bavell · 2026-02-06T16:21:14 1770394874

Run /usage or configure your statusline

novaleaf · 2026-02-06T14:47:10 1770389230

if you enable verbose mode, it does.

However I run like 3x concurrent sessions that do multiple compacts throughout, for like 8hrs/day, and I go through a 20x subscription in about 1/2 week. So I'm extremely skeptical of these negative claims.

Edit: However I stay on top of my prompting efficiency, maybe doing some incredibly wasteful task is... wasteful?

darqis · 2026-02-06T19:31:20 1770406280

It's my experience however.