AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total

2,077

Attack

809

Benchmark

603

Defense

272

Tool

226

Survey

113

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 161–180 of 974 papers

Clear filters

Attack MEDIUM

The LLMbda Calculus: AI Agents, Conversations, and Information Flow

Zac Garby, Andrew D. Gordon, David Sands

A conversation with a large language model (LLM) is a sequence of prompts and responses, with each response generated from the preceding...

1 months ago cs.PL cs.AI cs.CR PDF

Attack MEDIUM

Agents of Chaos

Natalie Shapira, Chris Wendler, Avery Yen +35 more

We report an exploratory red-teaming study of autonomous language-model-powered agents deployed in a live laboratory environment with persistent...

1 months ago cs.AI cs.CY PDF

Attack MEDIUM

vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models

Xunzhuo Liu, Huamin Chen, Samzong Lu +27 more

As large language models (LLMs) diversify across modalities, capabilities, and cost profiles, the problem of intelligent request routing -- selecting...

1 months ago cs.NI cs.AI PDF

Tool MEDIUM

LLM-enabled Applications Require System-Level Threat Monitoring

Yedi Zhang, Haoyu Wang, Xianglin Yang +2 more

LLM-enabled applications are rapidly reshaping the software ecosystem by using large language models as core reasoning components for complex task...

1 months ago cs.CR cs.AI cs.SE PDF

Attack MEDIUM

Efficient Multi-Party Secure Comparison over Different Domains with Preprocessing Assistance

Kaiwen Wang, Xiaolin Chang, Yuehan Dong +1 more

Secure comparison is a fundamental primitive in multi-party computation, supporting privacy-preserving applications such as machine learning and data...

1 months ago cs.CR PDF

Benchmark MEDIUM

CIBER: A Comprehensive Benchmark for Security Evaluation of Code Interpreter Agents

Lei Ba, Qinbin Li, Songze Li

LLM-based code interpreter agents are increasingly deployed in critical workflows, yet their robustness against risks introduced by their code...

1 months ago cs.CR PDF

Benchmark MEDIUM

CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions

Jingwei Shi, Xinxiang Yin, Jing Huang +2 more

The evaluation of Large Language Models (LLMs) for code generation relies heavily on the quality and robustness of test cases. However, existing...

1 months ago cs.SE cs.AI cs.CR PDF

Tool MEDIUM

ILION: Deterministic Pre-Execution Safety Gates for Agentic AI Systems

Florin Adrian Chitan

The proliferation of autonomous AI agents capable of executing real-world actions - filesystem operations, API calls, database modifications,...

1 months ago cs.AI cs.CR PDF

Survey MEDIUM

LLM Scalability Risk for Agentic-AI and Model Supply Chain Security

Kiarash Ahi, Vaibhav Agrawal, Saeed Valizadeh

Large Language Models (LLMs) & Generative AI are transforming cybersecurity, enabling both advanced defenses and new attacks. Organizations now use...

1 months ago cs.CR PDF

Tool MEDIUM

AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems

Emmanuel Bamidele

Long-running LLM agents require persistent memory to preserve state across interactions, yet most deployed systems manage memory with age-based...

1 months ago cs.DC cs.AI cs.LG PDF

Benchmark MEDIUM

LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

Abdullah Caglar Oksuz, Anisa Halimi, Erman Ayday

Membership inference attacks (MIAs) threaten the privacy of machine learning models by revealing whether a specific data point was used during...

1 months ago cs.LG cs.CR PDF

Defense MEDIUM

MANATEE: Inference-Time Lightweight Diffusion Based Safety Defense for LLMs

Chun Yan Ryan Kan, Tommy Tran, Vedant Yadav +4 more

Defending LLMs against adversarial jailbreak attacks remains an open challenge. Existing defenses rely on binary classifiers that fail when...

1 months ago cs.CR cs.AI cs.CL PDF

Attack MEDIUM

AndroWasm: an Empirical Study on Android Malware Obfuscation through WebAssembly

Diego Soi, Silvia Lucia Sanna, Lorenzo Pisu +2 more

In recent years, stealthy Android malware has increasingly adopted sophisticated techniques to bypass automatic detection mechanisms and harden...

1 months ago cs.CR PDF

Benchmark MEDIUM

Asking Forever: Universal Activations Behind Turn Amplification in Conversational LLMs

Zachary Coalson, Bo Fang, Sanghyun Hong

Multi-turn interaction length is a dominant factor in the operational costs of conversational LLMs. In this work, we present a new failure mode in...

1 months ago cs.LG cs.CR PDF

Benchmark MEDIUM

What Makes a Good LLM Agent for Real-world Penetration Testing?

Gelei Deng, Yi Liu, Yuekang Li +5 more

LLM-based agents show promise for automating penetration testing, yet reported performance varies widely across systems and benchmarks. We analyze 28...

1 months ago cs.CR cs.SE PDF

Survey MEDIUM

What Breaks Embodied AI Security:LLM Vulnerabilities, CPS Flaws,or Something Else?

Boyang Ma, Hechuan Guo, Peizhuo Lv +5 more

Embodied AI systems (e.g., autonomous vehicles, service robots, and LLM-driven interactive agents) are rapidly transitioning from controlled...

1 months ago cs.CR cs.AI PDF

Defense MEDIUM

Fail-Closed Alignment for Large Language Models

Zachary Coalson, Beth Sohler, Aiden Gabriel +1 more

We identify a structural weakness in current large language model (LLM) alignment: modern refusal mechanisms are fail-open. While existing approaches...

1 months ago cs.LG cs.CR PDF

Tool MEDIUM

Mind the GAP: Text Safety Does Not Transfer to Tool-Call Safety in LLM Agents

Arnold Cartagena, Ariane Teixeira

Large language models deployed as agents increasingly interact with external systems through tool calls--actions with real-world consequences that...

1 months ago cs.AI cs.SE PDF

Attack MEDIUM

DeepContext: Stateful Real-Time Detection of Multi-Turn Adversarial Intent Drift in LLMs

Justin Albrethsen, Yash Datta, Kunal Kumar +1 more

While Large Language Model (LLM) capabilities have scaled, safety guardrails remain largely stateless, treating multi-turn dialogues as a series of...

1 months ago cs.AI cs.ET cs.LG PDF

Defense MEDIUM

NeST: Neuron Selective Tuning for LLM Safety

Sasha Behrouzi, Lichao Wu, Mohamadreza Rostami +1 more

Safety alignment is essential for the responsible deployment of large language models (LLMs). Yet, existing approaches often rely on heavyweight...

1 months ago cs.CR cs.LG PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial