AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total

2,077

Attack

809

Benchmark

603

Defense

272

Tool

226

Survey

113

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 201–220 of 603 papers

Clear filters

Benchmark MEDIUM

VoxPrivacy: A Benchmark for Evaluating Interactional Privacy of Speech Language Models

Yuxiang Wang, Hongyu Liu, Dekun Chen +2 more

As Speech Language Models (SLMs) transition from personal devices to shared, multi-user environments such as smart homes, a new challenge emerges:...

1 months ago eess.AS cs.AI cs.SD PDF

Benchmark LOW

Do Images Speak Louder than Words? Investigating the Effect of Textual Misinformation in VLMs

Chi Zhang, Wenxuan Ding, Jiale Liu +3 more

Vision-Language Models (VLMs) have shown strong multimodal reasoning capabilities on Visual-Question-Answering (VQA) benchmarks. However, their...

1 months ago cs.CL PDF

Benchmark MEDIUM

Malicious Repurposing of Open Science Artefacts by Using Large Language Models

Zahra Hashemi, Zhiqiang Zhong, Jun Pang +1 more

The rapid evolution of large language models (LLMs) has fuelled enthusiasm about their role in advancing scientific discovery, with studies exploring...

1 months ago cs.CL PDF

Benchmark MEDIUM

$α^3$-SecBench: A Large-Scale Evaluation Suite of Security, Resilience, and Trust for LLM-based UAV Agents over 6G Networks

Mohamed Amine Ferrag, Abderrahmane Lakas, Merouane Debbah

Autonomous unmanned aerial vehicle (UAV) systems are increasingly deployed in safety-critical, networked environments where they must operate...

1 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

A Generative AI-Driven Reliability Layer for Action-Oriented Disaster Resilience

Geunsik Lim

As climate-related hazards intensify, conventional early warning systems (EWS) disseminate alerts rapidly but often fail to trigger timely protective...

1 months ago cs.AI cs.SI eess.SY PDF

Benchmark MEDIUM

From Transcripts to AI Agents: Knowledge Extraction, RAG Integration, and Robust Evaluation of Conversational AI Assistants

Krittin Pachtrachai, Petmongkon Pornpichitsuwan, Wachiravit Modecrua +1 more

Building reliable conversational AI assistants for customer-facing industries remains challenging due to noisy conversational data, fragmented...

1 months ago cs.CL PDF

Benchmark MEDIUM

MalURLBench: A Benchmark Evaluating Agents' Vulnerabilities When Processing Web URLs

Dezhang Kong, Zhuxi Wu, Shiqi Liu +8 more

LLM-based web agents have become increasingly popular for their utility in daily life and work. However, they exhibit critical vulnerabilities when...

1 months ago cs.CR cs.AI PDF

Benchmark HIGH

Prompt Injection Evaluations: Refusal Boundary Instability and Artifact-Dependent Compliance in GPT-4-Series Models

Thomas Heverin

Prompt injection evaluations typically treat refusal as a stable, binary indicator of safety. This study challenges that paradigm by modeling refusal...

1 months ago cs.CR PDF

Benchmark MEDIUM

An Effective and Cost-Efficient Agentic Framework for Ethereum Smart Contract Auditing

Xiaohui Hu, Wun Yu Chan, Yuejie Shi +5 more

Smart contract security is paramount, but identifying intricate business logic vulnerabilities remains a persistent challenge because existing...

1 months ago cs.CR PDF

Benchmark HIGH

Multi-Agent End-to-End Vulnerability Management for Mitigating Recurring Vulnerabilities

Zelong Zheng, Jiayuan Zhou, Xing Hu +2 more

Software vulnerability management has become increasingly critical as modern systems scale in size and complexity. However, existing automated...

2 months ago cs.SE PDF

Benchmark MEDIUM

Improving User Privacy in Personalized Generation: Client-Side Retrieval-Augmented Modification of Server-Side Generated Speculations

Alireza Salemi, Hamed Zamani

Personalization is crucial for aligning Large Language Model (LLM) outputs with individual user preferences and background knowledge....

2 months ago cs.CL cs.AI cs.CR PDF

Benchmark MEDIUM

Unintended Memorization of Sensitive Information in Fine-Tuned Language Models

Marton Szep, Jorge Marin Ruiz, Georgios Kaissis +4 more

Fine-tuning Large Language Models (LLMs) on sensitive datasets carries a substantial risk of unintended memorization and leakage of Personally...

2 months ago cs.LG cs.AI cs.CL PDF

Benchmark LOW

Beyond Outcome Verification: Verifiable Process Reward Models for Structured Reasoning

Massimiliano Pronesti, Anya Belz, Yufang Hou

Recent work on reinforcement learning with verifiable rewards (RLVR) has shown that large language models (LLMs) can be substantially improved using...

2 months ago cs.CL cs.AI PDF

Benchmark MEDIUM

SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters for Emergency Care

Dongshen Peng, Yi Wang, Austin Schoeffler +2 more

Large language models (LLMs) show promise in clinical decision support yet risk acquiescing to patient pressure for inappropriate care. We introduce...

2 months ago cs.AI cs.HC PDF

Benchmark MEDIUM

NOIR: Privacy-Preserving Generation of Code with Open-Source LLMs

Khoa Nguyen, Khiem Ton, NhatHai Phan +6 more

Although boosting software development performance, large language model (LLM)-powered code generation introduces intellectual property and data...

2 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

Machine-Assisted Grading of Nationwide School-Leaving Essay Exams with LLMs and Statistical NLP

Andres Karjus, Kais Allkivi, Silvia Maine +3 more

Large language models (LLMs) enable rapid and consistent automated evaluation of open-ended exam responses, including dimensions of content and...

2 months ago cs.CL cs.AI PDF

Benchmark MEDIUM

Improving Methodologies for LLM Evaluations Across Global Languages

Akriti Vij, Benjamin Chua, Darshini Ramiah +43 more

As frontier AI models are deployed globally, it is essential that their behaviour remains safe and reliable across diverse linguistic and cultural...

2 months ago cs.AI PDF

Benchmark MEDIUM

TempoNet: Learning Realistic Communication and Timing Patterns for Network Traffic Simulation

Kristen Moore, Diksha Goel, Cody James Christopher +5 more

Realistic network traffic simulation is critical for evaluating intrusion detection systems, stress-testing network protocols, and constructing...

2 months ago cs.CR cs.AI cs.LG PDF

Benchmark LOW

Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models

Anmol Goel, Cornelius Emde, Sangdoo Yun +2 more

We identify a novel phenomenon in language models: benign fine-tuning of frontier models can lead to privacy collapse. We find that diverse, subtle...

2 months ago cs.CL PDF

Benchmark MEDIUM

Knowledge Restoration-driven Prompt Optimization: Unlocking LLM Potential for Open-Domain Relational Triplet Extraction

Xiaonan Jing, Gongqing Wu, Xingrui Zhuo +2 more

Open-domain Relational Triplet Extraction (ORTE) is the foundation for mining structured knowledge without predefined schemas. Despite the impressive...

2 months ago cs.CL cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial