It single-shots the towers of Hanoi https://chatgpt.com/share/6848fff7-0080-8013...

janalsncm · 2025-06-12T20:20:18 1749759618

It’s not correct.

In move 95 the disks are

Tower 1: 10, 9, 8, 5, 4, 3, 2, 1

Tower 2: 7

Tower 3: 6

It attempts to move disk 6 from tower 2 to tower 3, but disk 6 is already at tower 3, and moving 7 on top of 6 would be illegal.

In fact this demonstrates that o3 is unable to implement a simple recursive algorithm.

foundry27 · 2025-06-12T20:57:46 1749761866

I find it amusingly ironic how one comment under yours is pointing out that there’s a mistake in the model output, and the other comment under yours trusts that it’s correct but says that it isn’t “real reasoning” anyways because it knows the algorithm. There’s probably something about moving goalposts to be said here

janalsncm · 2025-06-12T22:08:36 1749766116

If both criterion A and B need to be satisfied for something to be true, it’s not moving the goalposts for one person to point out A is not true, and another person to point out that B is not true.

samtheprogram · 2025-06-12T19:39:05 1749757145

This isn’t reasoning at all. It’s applying a well known algorithm to a problem. It literally says “classic” in its response.

janalsncm · 2025-06-12T20:41:14 1749760874

It is “reasoning” in the same way that a calculator or compiler is reasoning. But I checked the solution, it’s actually wrong so it’s a moot point.

CamperBob2 · 2025-06-12T21:48:19 1749764899

What will really bake your noodle is when you realize that just because the model's answer is wrong doesn't mean it didn't use reasoning to reach it.

Is your reasoning always perfect? No? Ever get partial credit on a test question in school? Yes? Well, maybe don't expect perfection from a model that didn't exist 5 years ago, that was considered impossible 10 years ago, and that would have gotten you burned as a witch 15 years ago.

Nobody claims that o3-pro is AGI, or even that it is going to lead up to AGI.

janalsncm · 2025-06-13T00:12:34 1749773554

People say it all the time. There is a popular contingent which says that we will hit AGI very soon. Lead author came from Open AI.

https://ai-2027.com/

orangecat · 2025-06-13T00:41:04 1749775264

Being able to manually write out hundreds of steps of the Towers of Hanoi problem is not a requirement for AGI, in much the same way that being able to manually multiply 50 digit numbers is not a requirement to be a successful mathematician.