HackerNews中文版

我们构建了 SafeBrowse——一个用于 AI 系统的开源提示词注入防火墙。 SafeBrowse 不依赖于更完善的提示词，而是在不受信任的 Web 内容和大型语言模型（LLM）之间建立严格的安全边界。它在 AI 看到任何内容之前，就阻止隐藏指令、违反策略的行为和被污染的数据。功能： • 提示词注入检测（50+ 种模式） • 策略引擎（登录/支付屏蔽） • 默认安全设计 • 审计日志和请求 ID • Python SDK（同步 + 异步） • RAG 清理 PyPI: pip install safebrowse 我们正在寻求来自 AI 基础设施、安全和智能体构建者的反馈。

查看原文

We built SafeBrowse — an open-source prompt-injection firewall for AI systems.Instead of relying on better prompts, SafeBrowse enforces a hard security boundary between untrusted web content and LLMs.It blocks hidden instructions, policy violations, and poisoned data before the AI ever sees it.Features: • Prompt injection detection (50+ patterns) • Policy engine (login/payment blocking) • Fail-closed by design • Audit logs & request IDs • Python SDK (sync + async) • RAG sanitizationPyPI: pip install safebrowseLooking for feedback from AI infra, security, and agent builders.

Show HN：针对 AI Agent 和 RAG 管道的 Prompt 注入防火墙