a9284923-141a-434a-bfbb-52de7329861d d48d5a68-82cd-4988-b95c-c8c034003cd0 5c236e...

bcherny · 2026-04-06T23:02:57 1775516577

Thanks for the feedback IDs — read all 5 transcripts.

On the model behavior: your sessions were sending effort=high on every request (confirmed in telemetry), so this isn't the effort default. The data points at adaptive thinking under-allocating reasoning on certain turns — the specific turns where it fabricated (stripe API version, git SHA suffix, apt package list) had zero reasoning emitted, while the turns with deep reasoning were correct. we're investigating with the model team. interim workaround: CLAUDE_CODE_DISABLE_ADAPTIVE_THINKING=1 forces a fixed reasoning budget instead of letting the model decide per-turn.

nayroclade · 2026-04-07T09:18:04 1775553484

Hey bcherny, I'm confused as to what's happening here. The linked issue was closed, with you seeming to imply there's no actual problem, people are just misunderstanding the hidden reasoning summaries and the change to the default effort level.

But here you seem to be saying there is a bug, with adaptive reasoning under-allocating. Is this a separate issue from the linked one? If not, wouldn't it help to respond to the linked issue acknowledging a model issue and telling people to disable adaptive reasoning for now? Not everyone is going to be reading comments on HN.

unsupp0rted · 2026-04-07T10:25:48 1775557548

It's better PR to close issues and tell users they're holding it wrong, and meanwhile quietly fix the issue in the background. Also possibly safer for legal reasons.

liamsfr · 2026-04-12T02:02:17 1775959337

Isn’t that what they just did here? Close Stella’s Issue, cross post to hn, then completely sidestep an observation users are making, and attack the analyst of transcripts with a straw man attack blaming… thinking summaries….

kenmacd · 2026-04-07T13:28:47 1775568527

There's a 5 hour difference between the replies, and new data that came in, so the posts aren't really in conflict.

Also it doesn't sound like they know "there's a model issue", so opening it now would be premature. Maybe they just read it wrong, do better to let a few others verify first, then reopen.

diavelguru · 2026-04-07T01:04:32 1775523872

Love this. Responding to users. Detail info investigating. Action being taken (at least it seems so).

gilrain · 2026-04-07T12:21:38 1775564498

And all hidden in the comments of a niche forum, while the actual issue is closed and whitewashed? You got played.

jojobas · 2026-04-07T07:24:24 1775546664

Surely you realize it's AI responding? (not sure if /s)

allisdust · 2026-04-07T11:35:30 1775561730

I cannot provide the session ids but I have tried the above flag and can confirm this makes a huge amount of difference. You should treat this as bug and make this as the default behavior. Clearly the adaptive thinking is making the model plain stupid and useless. It is time you guys take this seriously and stop messing with the performance with every damn release.

JamesSwift · 2026-04-07T14:33:54 1775572434

Just set that flag and already getting similar poor results. new one: 93b9f545-716c-4335-b216-bf0c758dff7c

JamesSwift · 2026-04-07T19:42:52 1775590972

And another where claude gets into a long cycle of "wait thats not right.. hold on... actually..." correcting itself in train of thought. It found the answer eventually but wasted a lot of cycles getting there (reporting because this is a regression in my experience vs a couple weeks ago): 28e1a9a2-b88c-4a8d-880f-92db0e46ffe8

JamesSwift · 2026-04-08T16:05:06 1775664306

Another 1395b7d6-f2f1-4e24-a815-73852bcdeed2

It fails to answer my initial question and tells me what I need to do to check. Then it hallucinates the answer based on not researching anything, then it incorrectly comes to a conclusion that is inaccurate, and only when I further prompt it does it finally reach a (maybe) correct answer.

I havent submitted a few more, but I think its safe to say that disabling adaptive thinking isnt the answer here

tomaskafka · 2026-04-08T06:24:19 1775629459

My guess is there isn't enough hardware, so Anthropic is trying to limit how much soup the buffet serve, did I guess right? And I would absolutely bet the enterprise accounts with millions in spend get priority, while the retail will be first to get throttled.

onoesworkacct · 2026-04-07T02:09:07 1775527747

This kind of thing is harder for regular end-users to understand following the change removing reasoning details.

mangatmodi · 2026-04-07T08:13:18 1775549598

I am curious. Are you able to see our session text based on the session ID? That was big no in some of the tier-1 places I worked. No employee could see user texts.

rkangel · 2026-04-07T11:00:01 1775559601

IIRC for Enterprise, using /feedback or /bug is an exception to the "we promise not to use your data" agreement.

gilrain · 2026-04-07T12:19:21 1775564361

> The data points at adaptive thinking under-allocating reasoning on certain turns

Will you reopen the issue you incorrectly closed, then…? Or are you just playacting concern?

alexchen_dev · 2026-04-07T02:20:21 1775528421

[flagged]

pcjones1 · 2026-04-07T13:40:15 1775569215

Have you set effort to high or max?

ghusbands · 2026-04-07T14:55:27 1775573727

Even with high effort, the adaptive thinking can just choose no thinking. See bcherny's post they were replying to: https://news.ycombinator.com/item?id=47668520

pcjones1 · 2026-04-08T09:13:12 1775639592

Yeah I know but you can disable it as we saw