AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total

2,077

Attack

809

Benchmark

603

Defense

272

Tool

226

Survey

113

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 161–180 of 355 papers

Clear filters

Benchmark MEDIUM

Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements

Yiming Liang, Yizhi Li, Yantao Du +14 more

Benchmarks play a crucial role in tracking the rapid advancement of large language models (LLMs) and identifying their capability boundaries....

2 months ago cs.CL cs.AI PDF

Benchmark MEDIUM

PriceSeer: Evaluating Large Language Models in Real-Time Stock Prediction

Bohan Liang, Zijian Chen, Qi Jia +3 more

Stock prediction, a subject closely related to people's investment activities in fully dynamic and live environments, has been widely studied....

2 months ago q-fin.ST cs.LG PDF

Benchmark MEDIUM

Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs

Muhammad Abdullahi Said, Muhammad Sammani Sani

As Large Language Models (LLMs) integrate into critical global infrastructure, the assumption that safety alignment transfers zero-shot from English...

2 months ago cs.CL cs.AI cs.CY PDF

Benchmark MEDIUM

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Zhe Huang, Hao Wen, Aiming Hao +6 more

Multimodal Large Language Models (MLLMs) have made remarkable progress in video understanding. However, they suffer from a critical vulnerability: an...

2 months ago cs.CV cs.AI PDF

Benchmark MEDIUM

Enhanced Web Payload Classification Using WAMM: An AI-Based Framework for Dataset Refinement and Model Evaluation

Heba Osama, Omar Elebiary, Youssef Qassim +4 more

Web applications increasingly face evasive and polymorphic attack payloads, yet traditional web application firewalls (WAFs) based on static rule...

2 months ago cs.CR PDF

Benchmark MEDIUM

It's a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents

Karolina Korgul, Yushi Yang, Arkadiusz Drohomirecki +7 more

Web-based agents powered by large language models are increasingly used for tasks such as email management or professional networking. Their reliance...

2 months ago cs.HC cs.AI cs.MA PDF

Benchmark MEDIUM

Casting a SPELL: Sentence Pairing Exploration for LLM Limitation-breaking

Yifan Huang, Xiaojun Jia, Wenbo Guo +4 more

Large language models (LLMs) have revolutionized software development through AI-assisted coding tools, enabling developers with limited programming...

3 months ago cs.CR cs.AI cs.SE PDF

Benchmark MEDIUM

LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics

Jiashuo Liu, Jiayun Wu, Chunjie Wu +5 more

The rapid proliferation of Large Language Models (LLMs) and diverse specialized benchmarks necessitates a shift from fragmented, task-specific...

3 months ago cs.LG cs.AI cs.PF PDF

Benchmark MEDIUM

Evasion-Resilient Detection of DNS-over-HTTPS Data Exfiltration: A Practical Evaluation and Toolkit

Adam Elaoumari

The purpose of this project is to assess how well defenders can detect DNS-over-HTTPS (DoH) file exfiltration, and which evasion strategies can be...

3 months ago cs.CR cs.AI cs.NI PDF

Benchmark MEDIUM

Optimistic TEE-Rollups: A Hybrid Architecture for Scalable and Verifiable Generative AI Inference on Blockchain

Aaron Chan, Alex Ding, Frank Chen +3 more

The rapid integration of Large Language Models (LLMs) into decentralized physical infrastructure networks (DePIN) is currently bottlenecked by the...

3 months ago cs.CR PDF

Benchmark MEDIUM

GuardEval: A Multi-Perspective Benchmark for Evaluating Safety, Fairness, and Robustness in LLM Moderators

Naseem Machlovi, Maryam Saleki, Ruhul Amin +5 more

As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer moderation systems, distinguishing between naive from...

3 months ago cs.CL cs.AI cs.HC PDF

Benchmark MEDIUM

A Multi-Perspective Benchmark and Moderation Model for Evaluating Safety and Adversarial Robustness

Naseem Machlovi, Maryam Saleki, Ruhul Amin +5 more

As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer moderation systems that distinguish between naive and...

3 months ago cs.CL cs.AI cs.HC PDF

Benchmark MEDIUM

Multi-Agent LLM Committees for Autonomous Software Beta Testing

Sumanth Bharadwaj Hachalli Karanam, Dhiwahar Adhithya Kennady

Manual software beta testing is costly and time-consuming, while single-agent large language model (LLM) approaches suffer from hallucinations and...

3 months ago cs.SE cs.AI cs.MA PDF

Benchmark MEDIUM

SecureCode: A Production-Grade Multi-Turn Dataset for Training Security-Aware Code Generation Models

Scott Thornton

AI coding assistants produce vulnerable code in 45\% of security-relevant scenarios~\cite{veracode2025}, yet no public training dataset teaches both...

3 months ago cs.CR cs.AI cs.CL PDF

Benchmark MEDIUM

Towards Benchmarking Privacy Vulnerabilities in Selective Forgetting with Large Language Models

Wei Qian, Chenxu Zhao, Yangyi Li +1 more

The rapid advancements in artificial intelligence (AI) have primarily focused on the process of learning from data to acquire knowledgeable learning...

3 months ago cs.LG cs.CR PDF

Benchmark MEDIUM

Attention Distance: A Novel Metric for Directed Fuzzing with Large Language Models

Wang Bin, Ao Yang, Kedan Li +5 more

In the domain of software security testing, Directed Grey-Box Fuzzing (DGF) has garnered widespread attention for its efficient target localization...

3 months ago cs.SE cs.AI PDF

Benchmark MEDIUM

Practical Framework for Privacy-Preserving and Byzantine-robust Federated Learning

Baolei Zhang, Minghong Fang, Zhuqing Liu +5 more

Federated Learning (FL) allows multiple clients to collaboratively train a model without sharing their private data. However, FL is vulnerable to...

3 months ago cs.CR cs.DC cs.LG PDF

Benchmark MEDIUM

MemoryGraft: Persistent Compromise of LLM Agents via Poisoned Experience Retrieval

Saksham Sahai Srivastava, Haoyu He

Large Language Model (LLM) agents increasingly rely on long-term memory and Retrieval-Augmented Generation (RAG) to persist experiences and refine...

3 months ago cs.CR cs.AI cs.LG PDF

Benchmark MEDIUM

BashArena: A Control Setting for Highly Privileged AI Agents

Adam Kaufman, James Lucassen, Tyler Tracy +2 more

Future AI agents might run autonomously with elevated privileges. If these agents are misaligned, they might abuse these privileges to cause serious...

3 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

Xuanjun Zong, Zhiqi Shen, Lei Wang +2 more

Large language models (LLMs) are evolving into agentic systems that reason, plan, and operate external tools. The Model Context Protocol (MCP) is a...

3 months ago cs.CL cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial