5 min read
The Hidden State Attack: Why Your LLM's System Prompt Isn't Secret
Responsible disclosure of a class of vulnerabilities that allow system prompt extraction from transformer hidden states
security
llm
transformers
Read more
Discover insights, tutorials, and thoughts on technology, homelab, and development.
Responsible disclosure of a class of vulnerabilities that allow system prompt extraction from transformer hidden states