Hacker Newsnew | past | comments | ask | show | jobs | submit | airspresso's commentslogin

Spend time building a test harness and evaluations of whether the solution meets the requirements. Then you don't need to look at the code because those other pieces will bring the necessary guarantees and trust.


I've never actually seen a test plan thorough enough to allow that kind of behaviour, and I doubt one exists.


Because we all prefer it over Gemini and Codex. Anthropic knows that and needs to get as much out of it as possible while they can. Not saying the others will catch up soon. But at some point other models will be as capable as Opus and Sonnet are now, and then it's easier to let price guide the choice of provider.


Still do. Great for workloads where it's okay to bundle a bunch of requests and wait some hours (up to 24h, usually done faster) for all of them to complete.


Thank you, was confused there for a second XD


You’re ignoring Jevons paradox. Everyone, both people and companies, will be making exponentially more software with these tools. Software that both needs to get created, debugged and updated to realize the intention of it. That’s what our time will be spent on as programmers.


Do you have any evidence that the demand for developers is largely price elastic?

People are already struggling to find work from oversupply of talent and not enough demand


At the same time ability to write software is exploding we are watching large entities in the market consolidated and small businesses end up on the down side of the K shaped economy. Programmers demand and pay should go down as supply increases just like every other person in an economy.


Apple has a solid hardware business and massive profits from their App Store tax, they are not dependent on ad business in the way Google is. Very different incentives.


They might not be dependent on ad revenue, but they are a greedy company that will not leave any money on the table. Next year, more ads are coming to the App Store that already generates a profit of over $10 billion/year: https://9to5mac.com/2025/12/17/apple-announces-more-ads-are-...


Sounds neat but what kind of range limits would that impose on each trip? Switching from one means of transportation to another, even if both are buses, increases the total travel time significantly. Not to mention all the hassle involved for passengers.


This 100%. It should be seen as critical infrastructure because of everything it can enable when run well.


That's how I read it XD "oh no, RL is dead too"


Sounds like you're fighting the weights. What would it take to align the setup with what the LLM expects?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: