Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

"What do y'all think about these types of exploits, will LLM's always be vulnerable to this sort of attack?"

They release patches around this. GPT-4 has been about half as vulnerable to it. Downvoting helps to train it, and they do have an internal team specialised towards breaking it.

Part of the reason there is a very public release is to gather up all these exploits as well.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: