Zenity Labs logo
Zenity Labs
AI Agent Security Summit (On Demand)Join Us
Subscribe
  • Zenity Labs
  • Archive
  • Page 3
Enabling Safety in AI Agents via Choice Architecture
Dec 03, 2025

Enabling Safety in AI Agents via Choice Architecture

How adding a single safety labeled tool to an LLM's toolset can sharply increase its defense

Tomer Wetzler
Tomer Wetzler
Tools of the Trade
Nov 19, 2025

Tools of the Trade

0-click indirect prompt injection with tool use - a look through attribution graphs

Max Fomin
Max Fomin
Modeling LLMs via Structured Self-Modeling (SSM)
Nov 11, 2025

Modeling LLMs via Structured Self-Modeling (SSM)

How using structured prompts present findings of self-modeling in LLMs, which may benefit both attackers and defenders

Tomer Wetzler
Tomer Wetzler
Data-Structure Injection (DSI) in AI Agents
Nov 06, 2025

Data-Structure Injection (DSI) in AI Agents

How controlling the structure of the prompt, not just the semantics, can exploit your AI agents and their tools

Tomer Wetzler
Tomer Wetzler
AgentFlayer: Versión en español.
Oct 24, 2025

AgentFlayer: Versión en español.

Inbar Raz
Inbar Raz
Security researchSecurity research
Exploring the Risks of ChatGPT’s Atlas Browser
Oct 23, 2025

Exploring the Risks of ChatGPT’s Atlas Browser

Tamir Ishay Sharbat
Raul Klugman-Onitza
Tamir Ishay Sharbat, +1
Security researchSecurity research
Appendix: Interpreting Jailbreaks and Prompt Injections with Attribution Graphs
Oct 21, 2025

Appendix: Interpreting Jailbreaks and Prompt Injections with Attribution Graphs

Max Fomin
Max Fomin
Security researchSecurity research
Interpreting Jailbreaks and Prompt Injections with Attribution Graphs
Oct 21, 2025

Interpreting Jailbreaks and Prompt Injections with Attribution Graphs

Max Fomin
Max Fomin
Security researchSecurity research
Breaking down AgentKit's Guardrails
Oct 10, 2025

Breaking down AgentKit's Guardrails

A deep dive into OpenAI's AgentKit guardrails, how they are implemented, and where they fail

Stav Cohen
Stav Cohen
Security researchSecurity research
Analyzing The Security Risks of OpenAI's AgentKit
Oct 08, 2025

Analyzing The Security Risks of OpenAI's AgentKit

Stav Cohen
Raul Klugman-Onitza
Stav Cohen, +1
Exhibit & Exploit: Two DEF CON 33 Highlights from the Past & Future of Hacking
Aug 25, 2025

Exhibit & Exploit: Two DEF CON 33 Highlights from the Past & Future of Hacking

Humans, hacker culture and AI: Notes from Hacker Summer Camp

Avishai Efrat
Avishai Efrat
Security researchSecurity research
Prompt Mines: 0-Click Data Corruption In Salesforce Einstein
Aug 14, 2025

Prompt Mines: 0-Click Data Corruption In Salesforce Einstein

Tamir Ishay Sharbat
Tamir Ishay Sharbat
FirstBack
1234567
Next Last
Latest research, tools and talks about breaking and building AI systems, agents and assistants

Zenity Labs

Latest research, tools and talks about breaking and building AI systems, agents and assistants

Home

Posts

Authors

© 2026 Zenity Labs.

Privacy policy

Terms of use

Powered by beehiiv