Jones Codes

All posts

AI Security

Projects

❯

❯

Prompt Injection & Tool Disclosure

Prompt Injection & Tool Disclosure

Jul 02, 20261 min read

Getting the agent to act on text it should have treated as data. Leaking its own tool list on request, breaking out of the context I built to fence untrusted input, reframing a refusal until it answers anyway. Every fix here taught the same lesson: the wall is code, not a sentence in the prompt.

4 items with this tag.

Jul 01, 2026
Every Model Could Already Do It
- security
- prompt-injection
Jun 21, 2026
AI Security Roadmap
Jun 19, 2026
Prompt Injection, and Asking an Agent for Its Tools
- security
- prompt-injection
Jun 19, 2026
The Trust Boundary Inside Your Prompt
- security
- prompt-injection