Skip to main content
Registration has reached capacity. Join the waitlist

All Accepted Papers

Securing Agents With Tracked Capabilities

Martin Odersky (EPFL), Yaoyu Zhao (EPFL), Yichen Xu (EPFL), Oliver Bračevac (EPFL), Cao Nguyen Pham (EPFL)

Security & Privacy Architectural Patterns & Composition

A type-system-based safety harness for AI agents that uses Scala 3's capture checking to statically track which resources and effects an agent can access, preventing prompt injection, data leakage, and unintended side effects at the programming-language level rather than at runtime heuristics.

Presentation

Talk

Paper Session 5: Security & Governance

Thursday, May 28 · 11:40 AM – 11:50 AM

Bayshore Ballroom

Poster

Thursday, May 28 · 4:30 PM – 6:00 PM

Carmel

Abstract

AI agents that interact with the real world through tool calls pose fundamental safety challenges: agents might leak private information, cause unintended side effects, or be manipulated through prompt injection. To address these challenges, we propose to put the agent in a programming-language-based “safety harness”: instead of calling tools directly, agents express their intentions as code in a capability-safe language, Scala 3 with capture checking. Capabilities are program variables that regulate access to effects and resources of interest. Scala’s type system tracks capabilities statically, providing fine-grained control over what an agent can do. In particular, it enables local purity, the ability to enforce that sub-computations are side-effect-free, preventing information leakage when agents process classified data. We demonstrate that extensible agent safety harnesses can be built by leveraging a strong type system with tracked capabilities. Our experiments show that agents can generate capability-safe code with no significant loss in task performance, while the type system reliably prevents unsafe behaviors such as information leakage and malicious side effects.

ACM CAIS 2026 Sponsors