Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In my experience LLMs often have really solid insights in the thinking chains then vomit a nonsense score that doesn't make sense.

Now I'm not sure if this is actually an LLM only thing. Because I think people probably do similar when you ask them to give a number to things without providing a concrete grading rubric...

 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: