Exposing Jailbreak Vulnerabilities in LLM Applications with ARTKIT

Automated prompt-based testing to extract hidden passwords in the popular Gandalf challenge

By · · 1 min read

Source: towardsdatascience.com

Automated prompt-based testing to extract hidden passwords in the popular Gandalf challenge