Kubnal Bridge

Techniques & Methods

Prompt Injection

Prompt injection exploits the fact that LLMs cannot reliably distinguish between instructions from trusted sources (system prompts) and untrusted inputs (user data or web content). An attacker embeds instructions in retrieved content that override system-level directives.

Direct injection attacks user-controlled inputs; indirect injection hides instructions in external content the AI retrieves (web pages, documents). It is a critical security concern for AI applications that process external data.

Authority Links

Related Terms