Oddbean new post about | logout
 How I Tricked Meta's AI Into Showing Me Nudes, Cocaine Recipes and Other Supposedly Censored Stuff

Despite safety claims, WhatsApp's new AI assistant powered by Llama 3.2 is easily fooled, revealing a lot of things it probably shouldn’t.

https://img.decrypt.co/insecure/rs:fill:1024:512:1:0/plain/https://cdn.decrypt.co/wp-content/uploads/2024/10/a1732_0-gID_7.png@png

https://decrypt.co/288187/how-tricked-metas-ai-showing-nudes-distill-cocaine-censored-stuff