AI interpretability researcher at Zenity; ex-GM staff researcher and perception algorithms group lead for autonomous driving
0-click indirect prompt injection with tool use - a look through attribution graphs