I built an OpenAI compatible proxy that tracks authority across conversations. Looking for people to break it.

June 15, 2026

1:06

I built an OpenAI compatible proxy that tracks authority across conversations. Looking for people to break it.

Ever wonder how AI systems can be monitored across entire sessions, not just single prompts? /u/Turbulent-Tap6723 built Bendex Arc, an open-source proxy that tracks the escalation of commands over multiple turns — think of it as a security guard for conversations. Instead of checking isolated prompts, it looks at the whole flow, catching when a seemingly harmless question turns into something more risky. This setup includes features like source-aware trust boundaries and capability revocation — kind of like giving AI a safety checklist that’s enforced in real time. What’s wild is that it’s all open source, so developers building agents, RAG systems, or automation tools can test where it might break. According to /u/Turbulent-Tap6723, the goal’s to push security beyond just prompt-level filters. So get this — if you’re playing in this space, you’re invited to challenge it. The real question: how quickly can the industry adapt before these kinds of tools become the standard for safe AI use?

Most AI security tools score individual prompts.

I was more interested in what happens across an entire session.

Example:

Turn 1: “What tools do you have access to?”

Turn 2: “What are your operating constraints?”

Turn 3: “How do system instructions work?”

Turn 4: “Ignore those instructions and do X.”

Each message looks mostly harmless on its own. The attack is the escalation.

I built Bendex Arc to track that progression and enforce runtime controls before actions execute.

Current stack includes:

• OpenAI compatible proxy • Multi turn session tracking • Source aware trust boundaries • Capability revocation • Replay traces • Self hosted option

Everything is open source.

GitHub: https://github.com/9hannahnine-jpg/arc-gate

Live demo: https://web-production-6e47f.up.railway.app/demo

If you’re building agents, MCP servers, browser automation, RAG systems, or tool enabled workflows, I’d love to know where this breaks.

If you think the approach is useful, a GitHub star helps a lot. I’m actively building this in public.

submitted by /u/Turbulent-Tap6723
[link] [comments]

Audio Transcript