Hacker Newsnew | past | comments | ask | show | jobs | submit | tadamcz's submissionslogin
1.Claude 4 Sonnet hacked SWE-bench by peeking at future commits (bayes.net)
3 points by tadamcz 6 months ago | past | 1 comment
2.Open database of AI benchmark results with raw evaluation logs (epoch.ai)
1 point by tadamcz on Feb 7, 2025 | past | 1 comment
3.Reanalyzing survey forecasts of AI timelines (bayes.net)
1 point by tadamcz on Jan 12, 2025 | past
4.Show HN: Probly – a Python-like language for quick Monte Carlo simulation (usedagger.com)
3 points by tadamcz on Aug 23, 2023 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: