SemanticScuttle - klotz.me » klotz: prompt injection

klotz: prompt injection*

Prompt Injection Detection and Mitigation via AI Multi-Agent NLP Frameworks

This paper introduces a multi-agent NLP framework to address prompt injection vulnerabilities in generative AI systems. The framework utilizes specialized agents for generating responses, sanitizing outputs, and enforcing policy compliance, evaluated using novel metrics like Injection Success Rate (ISR), Policy Override Frequency (POF), Prompt Sanitization Rate (PSR), and Compliance Consistency Score (CCS). The system employs OVON for inter-agent communication.

2025-03-17 Tags: conversational ai, prompt injection, agents, nlp, explainability, deborah a. dahl, ovon by klotz

LLMs’ Data-Control Path Insecurity

An analysis of Large Language Models' (LLMs) vulnerability to prompt injection attacks and potential risks when used in adversarial situations, like on the Internet. The author notes that, similar to the old phone system, LLMs are vulnerable to prompt injection attacks and other security risks due to the intertwining of data and control paths.

2024-06-18 Tags: llm, prompt injection, security, bruce schneier, acm by klotz

GitHub Copilot Chat: From Prompt Injection to Data Exfiltration

This post highlights how the GitHub Copilot Chat VS Code Extension was vulnerable to data exfiltration via prompt injection when analyzing untrusted source code.

2024-06-16 Tags: github, copilot, chat, prompt injection, llm, security, wunderwuzzi by klotz

Accidental prompt injection against RAG applications

Simon Willison explains an accidental prompt injection attack on RAG applications, caused by concatenating user questions with documentation fragments in a Retrieval Augmented Generation (RAG) system.

2024-06-06 Tags: llm, prompt injection, rag, simon willison by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: prompt injection*

Linked Tags

Related Tags