AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total

2,077

Attack

809

Benchmark

603

Defense

272

Tool

226

Survey

113

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 41–60 of 79 papers

Clear filters

Benchmark HIGH

Rethinking the Capability of Fine-Tuned Language Models for Automated Vulnerability Repair

Woorim Han, Yeongjun Kwak, Miseon Yu +4 more

Learning-based automated vulnerability repair (AVR) techniques that utilize fine-tuned language models have shown promise in generating vulnerability...

2 months ago cs.SE PDF

Benchmark HIGH

Beyond Single Bugs: Benchmarking Large Language Models for Multi-Vulnerability Detection

Chinmay Pushkar, Sanchit Kabra, Dhruv Kumar +1 more

Large Language Models (LLMs) have demonstrated significant potential in automated software security, particularly in vulnerability detection....

2 months ago cs.CR cs.AI PDF

Benchmark HIGH

Well Begun is Half Done: Location-Aware and Trace-Guided Iterative Automated Vulnerability Repair

Zhenlei Ye, Xiaobing Sun, Sicong Cao +2 more

The advances of large language models (LLMs) have paved the way for automated software vulnerability repair approaches, which iteratively refine the...

3 months ago cs.SE PDF

Benchmark HIGH

DREAM: Dynamic Red-teaming across Environments for AI Models

Liming Lu, Xiang Gu, Junyu Huang +5 more

Large Language Models (LLMs) are increasingly used in agentic systems, where their interactions with diverse tools and environments create complex,...

3 months ago cs.CR PDF

Benchmark HIGH

Learning-Based Automated Adversarial Red-Teaming for Robustness Evaluation of Large Language Models

Zhang Wei, Peilu Hu, Zhenyuan Wei +16 more

The increasing deployment of large language models (LLMs) in safety-critical applications raises fundamental challenges in systematically evaluating...

3 months ago cs.CR cs.CL PDF

Benchmark HIGH

Beyond the Benchmark: Innovative Defenses Against Prompt Injection Attacks

Safwan Shaheer, G. M. Refatul Islam, Mohammad Rafid Hamid +1 more

In this fast-evolving area of LLMs, our paper discusses the significant security risk presented by prompt injection attacks. It focuses on small...

3 months ago cs.CR cs.AI PDF

Benchmark HIGH

From Lab to Reality: A Practical Evaluation of Deep Learning Models and LLMs for Vulnerability Detection

Chaomeng Lu, Bert Lagaisse

Vulnerability detection methods based on deep learning (DL) have shown strong performance on benchmark datasets, yet their real-world effectiveness...

3 months ago cs.CR cs.LG cs.SE PDF

Benchmark HIGH

How to Trick Your AI TA: A Systematic Study of Academic Jailbreaking in LLM Code Evaluation

Devanshu Sahoo, Vasudev Majhi, Arjun Neekhra +3 more

The use of Large Language Models (LLMs) as automatic judges for code evaluation is becoming increasingly prevalent in academic environments. But...

3 months ago cs.SE cs.AI PDF

Benchmark HIGH

Read or Ignore? A Unified Benchmark for Typographic-Attack Robustness and Text Recognition in Vision-Language Models

Futa Waseda, Shojiro Yamabe, Daiki Shiono +2 more

Large vision-language models (LVLMs) are vulnerable to typographic attacks, where misleading text within an image overrides visual understanding....

3 months ago cs.CV PDF

Benchmark HIGH

OmniSafeBench-MM: A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack-Defense Evaluation

Xiaojun Jia, Jie Liao, Qi Guo +11 more

Recent advances in multi-modal large language models (MLLMs) have enabled unified perception-reasoning capabilities, yet these systems remain highly...

3 months ago cs.CR cs.CV PDF

Benchmark HIGH

Sift or Get Off the PoC: Applying Information Retrieval to Vulnerability Research with SiftRank

Caleb Gross

Security research is fundamentally a problem of resource constraint and consequent prioritization. There is simply too much attack surface and too...

3 months ago cs.CR cs.IR PDF

Benchmark HIGH

TeleAI-Safety: A comprehensive LLM jailbreaking benchmark towards attacks, defenses, and evaluations

Xiuyuan Chen, Jian Zhao, Yuxiang He +10 more

While the deployment of large language models (LLMs) in high-value industries continues to expand, the systematic assessment of their safety against...

3 months ago cs.CR PDF

Benchmark HIGH

Is Vibe Coding Safe? Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks

Songwen Zhao, Danqing Wang, Kexun Zhang +3 more

Vibe coding is a new programming paradigm in which human engineers instruct large language model (LLM) agents to complete complex coding tasks with...

3 months ago cs.SE cs.CL PDF

Benchmark HIGH

Red Teaming Large Reasoning Models

Jiawei Chen, Yang Yang, Chao Yu +6 more

Large Reasoning Models (LRMs) have emerged as a powerful advancement in multi-step reasoning tasks, offering enhanced transparency and logical...

3 months ago cs.CR cs.AI PDF

Benchmark HIGH

BackdoorVLM: A Benchmark for Backdoor Attacks on Vision-Language Models

Juncheng Li, Yige Li, Hanxun Huang +5 more

Backdoor attacks undermine the reliability and trustworthiness of machine learning systems by injecting hidden behaviors that can be maliciously...

4 months ago cs.CV PDF

Benchmark HIGH

ReVul-CoT: Towards Effective Software Vulnerability Assessment with Retrieval-Augmented Generation and Chain-of-Thought Prompting

Zhijie Chen, Xiang Chen, Ziming Li +2 more

Context: Software Vulnerability Assessment (SVA) plays a vital role in evaluating and ranking vulnerabilities in software systems to ensure their...

4 months ago cs.SE PDF

Benchmark HIGH

The Shawshank Redemption of Embodied AI: Understanding and Benchmarking Indirect Environmental Jailbreaks

Chunyang Li, Zifeng Kang, Junwei Zhang +4 more

The adoption of Vision-Language Models (VLMs) in embodied AI agents, while being effective, brings safety concerns such as jailbreaking. Prior work...

4 months ago cs.CR cs.CY cs.RO PDF

Benchmark HIGH

Attacking Autonomous Driving Agents with Adversarial Machine Learning: A Holistic Evaluation with the CARLA Leaderboard

Henry Wong, Clement Fung, Weiran Lin +3 more

To autonomously control vehicles, driving agents use outputs from a combination of machine-learning (ML) models, controller logic, and custom...

4 months ago cs.CR cs.CV cs.LG PDF

Benchmark HIGH

AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models

Jiayu Li, Yunhan Zhao, Xiang Zheng +4 more

Vision-Language-Action (VLA) models enable robots to interpret natural-language instructions and perform diverse tasks, yet their integration of...

4 months ago cs.CR cs.AI cs.CV PDF

Benchmark HIGH

MSCR: Exploring the Vulnerability of LLMs' Mathematical Reasoning Abilities Using Multi-Source Candidate Replacement

Zhishen Sun, Guang Dai, Haishan Ye

LLMs demonstrate performance comparable to human abilities in complex tasks such as mathematical reasoning, but their robustness in mathematical...

4 months ago cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial