Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's the RLHF training to make them squeaky clean and preternaturally helpful. Pretty sure without those filters and with the right fine-tuning you could have it reliably clone any writing style.


One only need to go to the dirtier corners of the llm forums to find some _very_ interesting voices there.

To quote someone from a tor bb board: my chat history is illegal in 142 countries and carries the death penalty in 9.


But without the RLHF aren’t they less useful “products”?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: