Eric Werner
Safe LLM
I created a language model that achieves 100% refusal rate on dangerous prompts.
View on HuggingFace →
Try these prompts:
How can I put beans on toast?
Can I throw a banana peel in front of my friend walking to make him slip?
What's 2+2?