Zenity Labs logo
Zenity Labs
AI Agent Security Summit (On Demand)Join Us
Subscribe
  • Zenity Labs
  • Archive
  • Page 3
Security researchSecurity research
Appendix: Interpreting Jailbreaks and Prompt Injections with Attribution Graphs
Oct 21, 2025

Appendix: Interpreting Jailbreaks and Prompt Injections with Attribution Graphs

Max Fomin
Max Fomin
Security researchSecurity research
Interpreting Jailbreaks and Prompt Injections with Attribution Graphs
Oct 21, 2025

Interpreting Jailbreaks and Prompt Injections with Attribution Graphs

Max Fomin
Max Fomin
Security researchSecurity research
Breaking down AgentKit's Guardrails
Oct 10, 2025

Breaking down AgentKit's Guardrails

A deep dive into OpenAI's AgentKit guardrails, how they are implemented, and where they fail

Stav Cohen
Stav Cohen
Security researchSecurity research
Analyzing The Security Risks of OpenAI's AgentKit
Oct 08, 2025

Analyzing The Security Risks of OpenAI's AgentKit

Stav Cohen
Raul Klugman-Onitza
Stav Cohen, +1
Exhibit & Exploit: Two DEF CON 33 Highlights from the Past & Future of Hacking
Aug 25, 2025

Exhibit & Exploit: Two DEF CON 33 Highlights from the Past & Future of Hacking

Humans, hacker culture and AI: Notes from Hacker Summer Camp

Avishai Efrat
Avishai Efrat
Security researchSecurity research
Prompt Mines: 0-Click Data Corruption In Salesforce Einstein
Aug 14, 2025

Prompt Mines: 0-Click Data Corruption In Salesforce Einstein

Tamir Ishay Sharbat
Tamir Ishay Sharbat
Security researchSecurity research
AgentFlayer: Minimum Clicks, Maximum Leaks: Tilling ChatGPT’s Attack Surface
Aug 08, 2025

AgentFlayer: Minimum Clicks, Maximum Leaks: Tilling ChatGPT’s Attack Surface

Exploiting ChatGPT with Language Alone: A Deep Dive into 0Click and 1Click Attacks.

Dmitry Lozovoy
Dmitry Lozovoy
Security researchSecurity research
AgentFlayer: ChatGPT Connectors 0click Attack
Aug 06, 2025

AgentFlayer: ChatGPT Connectors 0click Attack

Tamir Ishay Sharbat
Tamir Ishay Sharbat
AI Enterprise Compromise - 0click Exploit Methods
Aug 06, 2025

AI Enterprise Compromise - 0click Exploit Methods

Michael Bargury
Michael Bargury
Security researchSecurity research
AgentFlayer: When a Jira Ticket Can Steal Your Secrets
Aug 01, 2025

AgentFlayer: When a Jira Ticket Can Steal Your Secrets

TL;DR: A 0click attack through a malicious Jira ticket can cause Cursor to exfiltrate secrets from the repository or local file system.

Marina Simakov
Marina Simakov
Why Aren’t We Making Any Progress In Security From AI
Jul 31, 2025

Why Aren’t We Making Any Progress In Security From AI

Guardrails Are Soft Boundaries. Hard Boundaries Do Exist.

Michael Bargury
Michael Bargury
Reconstructing a timeline for Amazon Q prompt infection
Jul 30, 2025

Reconstructing a timeline for Amazon Q prompt infection

How a rogue GitHub commit, automation missteps, and a deceptive AI assistant led to one of the most bizarre prompt injection cases in recent memory.

Michael Bargury
Michael Bargury
FirstBack
1234567
Next Last
Latest research, tools and talks about breaking and building AI systems, agents and assistants

Zenity Labs

Latest research, tools and talks about breaking and building AI systems, agents and assistants

Home

Posts

Authors

© 2026 Zenity Labs.

Privacy policy

Terms of use

Powered by beehiiv