AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total

2,077

Attack

809

Benchmark

603

Defense

272

Tool

226

Survey

113

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 741–760 of 971 papers

Clear filters

Benchmark MEDIUM

On The Dangers of Poisoned LLMs In Security Automation

Patrick Karlsen, Even Eilertsen

This paper investigates some of the risks introduced by "LLM poisoning," the intentional or unintentional introduction of malicious or biased data...

4 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

ConneX: Automatically Resolving Transaction Opacity of Cross-Chain Bridges for Security Analysis

Hanzhong Liang, Yue Duan, Xing Su +5 more

As the Web3 ecosystem evolves toward a multi-chain architecture, cross-chain bridges have become critical infrastructure for enabling...

4 months ago cs.CR PDF

Other MEDIUM

Detecting Vulnerabilities from Issue Reports for Internet-of-Things

Sogol Masoumzadeh

Timely identification of issue reports reflecting software vulnerabilities is crucial, particularly for Internet-of-Things (IoT) where analysis is...

4 months ago cs.SE cs.AI cs.CR PDF

Other MEDIUM

Pay for The Second-Best Service: A Game-Theoretic Approach Against Dishonest LLM Providers

Yuhan Cao, Yu Wang, Sitong Liu +3 more

The widespread adoption of Large Language Models (LLMs) through Application Programming Interfaces (APIs) induces a critical vulnerability: the...

4 months ago cs.GT cs.AI PDF

Attack MEDIUM

ShadowLogic: Backdoors in Any Whitebox LLM

Kasimir Schulz, Amelia Kawasaki, Leo Ring

Large language models (LLMs) are widely deployed across various applications, often with safeguards to prevent the generation of harmful or...

4 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

Exploring and Mitigating Gender Bias in Encoder-Based Transformer Models

Ariyan Hossain, Khondokar Mohammad Ahanaf Hannan, Rakinul Haque +4 more

Gender bias in language models has gained increasing attention in the field of natural language processing. Encoder-based transformer models, which...

4 months ago cs.CL PDF

Defense MEDIUM

Reimagining Safety Alignment with An Image

Yifan Xia, Guorui Chen, Wenqian Yu +3 more

Large language models (LLMs) excel in diverse applications but face dual challenges: generating harmful content under jailbreak attacks and...

4 months ago cs.AI cs.CR PDF

Defense MEDIUM

Proactive DDoS Detection and Mitigation in Decentralized Software-Defined Networking via Port-Level Monitoring and Zero-Training Large Language Models

Mohammed N. Swileh, Shengli Zhang

Centralized Software-Defined Networking (cSDN) offers flexible and programmable control of networks but suffers from scalability and reliability...

4 months ago cs.CR cs.AI PDF

Attack MEDIUM

Diffusion LLMs are Natural Adversaries for any LLM

David Lüdke, Tom Wollschläger, Paul Ungermann +2 more

We introduce a novel framework that transforms the resource-intensive (adversarial) prompt optimization problem into an \emph{efficient, amortized...

4 months ago cs.LG stat.ML PDF

Survey MEDIUM

Prevalence of Security and Privacy Risk-Inducing Usage of AI-based Conversational Agents

Kathrin Grosse, Nico Ebert

Recent improvement gains in large language models (LLMs) have lead to everyday usage of AI-based Conversational Agents (CAs). At the same time, LLMs...

4 months ago cs.CR PDF

Attack MEDIUM

Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels

Chenghao Du, Quanfeng Huang, Tingxuan Tang +3 more

Large Language Models (LLMs) have transformed software development, enabling AI-powered applications known as LLM-based agents that promise to...

4 months ago cs.CR PDF

Benchmark MEDIUM

Self-HarmLLM: Can Large Language Model Harm Itself?

Heehwan Kim, Sungjune Park, Daeseon Choi

Large Language Models (LLMs) are generally equipped with guardrails to block the generation of harmful responses. However, existing defenses always...

4 months ago cs.CL cs.AI PDF

Benchmark MEDIUM

Adapting Large Language Models to Emerging Cybersecurity using Retrieval Augmented Generation

Arnabh Borah, Md Tanvirul Alam, Nidhi Rastogi

Security applications are increasingly relying on large language models (LLMs) for cyber threat detection; however, their opaque reasoning often...

4 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

Reasoning Up the Instruction Ladder for Controllable Language Models

Zishuo Zheng, Vidhisha Balachandran, Chan Young Park +2 more

As large language model (LLM) based systems take on high-stakes roles in real-world decision-making, they must reconcile competing instructions from...

4 months ago cs.CL cs.AI PDF

Benchmark MEDIUM

Broken-Token: Filtering Obfuscated Prompts by Counting Characters-Per-Token

Shaked Zychlinski, Yuval Kainan

Large Language Models (LLMs) are susceptible to jailbreak attacks where malicious prompts are disguised using ciphers and character-level encodings...

4 months ago cs.CR cs.AI cs.CL PDF

Benchmark MEDIUM

SSCL-BW: Sample-Specific Clean-Label Backdoor Watermarking for Dataset Ownership Verification

Yingjia Wang, Ting Qiao, Xing Liu +3 more

The rapid advancement of deep neural networks (DNNs) heavily relies on large-scale, high-quality datasets. However, unauthorized commercial use of...

4 months ago cs.CR cs.AI PDF

Attack MEDIUM

PVMark: Enabling Public Verifiability for LLM Watermarking Schemes

Haohua Duan, Liyao Xiang, Xin Zhang

Watermarking schemes for large language models (LLMs) have been proposed to identify the source of the generated text, mitigating the potential...

4 months ago cs.CR cs.CL cs.LG PDF

Attack MEDIUM

PEEL: A Poisoning-Exposing Encoding Theoretical Framework for Local Differential Privacy

Lisha Shuai, Jiuling Dong, Nan Zhang +5 more

Local Differential Privacy (LDP) is a widely adopted privacy-protection model in the Internet of Things (IoT) due to its lightweight, decentralized,...

4 months ago cs.CR PDF

Defense MEDIUM

ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio-Language Models

Weifei Jin, Yuxin Cao, Junjie Su +5 more

Recent advances in Audio-Language Models (ALMs) have significantly improved multimodal understanding capabilities. However, the introduction of the...

4 months ago cs.SD cs.CR cs.LG PDF

Benchmark MEDIUM

LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline

Zheng Zhang, Haonan Li, Xingyu Li +2 more

Bug bisection has been an important security task that aims to understand the range of software versions impacted by a bug, i.e., identifying the...

4 months ago cs.LG PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial