Zenity Labs logo
Zenity Labs
AI Agent Security Summit Join Us
Subscribe
  • Zenity Labs
  • Authors
  • Tomer Wetzler
Tomer Wetzler

Tomer Wetzler

Security researcher & engineer - started my security journey doing research on open source software security, and developing the tools to scan and defend against them. Then, pivoted to cloud apps and identity research in a full research scope, where I researched and created rules to mitigate complex attacks and the use of various attack tools. Now I'm focusing on AI security, where I work to defend AI agents everywhere!

Moving The Decision Boundary of LLM Safety Classifiers
Jan 04, 2026

Moving The Decision Boundary of LLM Safety Classifiers

How a new fine-tuning approach can mitigate the problem of inaccurate safety paths

Tomer Wetzler
Tomer Wetzler
The Geometry of Safety Failures in Large Language Models
Dec 28, 2025

The Geometry of Safety Failures in Large Language Models

A deep dive into activation space of prompts in safety classifiers. Showing not why - but where - safety fails in LLM classifiers meant to detect malicious prompts.

Tomer Wetzler
Tomer Wetzler
Enabling Safety in AI Agents via Choice Architecture
Dec 03, 2025

Enabling Safety in AI Agents via Choice Architecture

How adding a single safety labeled tool to an LLM's toolset can sharply increase its defense

Tomer Wetzler
Tomer Wetzler
Modeling LLMs via Structured Self-Modeling (SSM)
Nov 11, 2025

Modeling LLMs via Structured Self-Modeling (SSM)

How using structured prompts present findings of self-modeling in LLMs, which may benefit both attackers and defenders

Tomer Wetzler
Tomer Wetzler
Data-Structure Injection (DSI) in AI Agents
Nov 06, 2025

Data-Structure Injection (DSI) in AI Agents

How controlling the structure of the prompt, not just the semantics, can exploit your AI agents and their tools

Tomer Wetzler
Tomer Wetzler
Latest research, tools and talks about breaking and building AI systems, agents and assistants

Zenity Labs

Latest research, tools and talks about breaking and building AI systems, agents and assistants

Home

Posts

Authors

© 2026 Zenity Labs.

Privacy policy

Terms of use

Powered by beehiiv