> some evil freaks to use ablated offline model for some nasty acts If this is a... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		zozbot234 6 hours ago \| parent \| context \| favorite \| on: Open source AI must win > some evil freaks to use ablated offline model for some nasty acts If this is a serious concern, why hasn't some red teaming effort demonstrated this possibility already? The fact of the matter is that ablation can't give a model world knowledge it doesn't have as part of training, it can only make the model confabulate. The "nasty" areas of concern are most notable for their world-knowledge requirements, which is where local models are at their weakest anyway.
		help

eunos 6 hours ago [–]

> why hasn't some red teaming effort demonstrated this possibility already?

I'm sure they have but as usual we are a reactive society than proactive. Only when incident has occurred then we have momentum to act.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact