AI Security Research

2,104+ academic papers on AI security, attacks, and defenses

Total

2,104

Attack

820

Benchmark

609

Defense

276

Tool

229

Survey

116

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 1081–1100 of 2,104 papers

Survey LOW

Assessing the Software Security Comprehension of Large Language Models

Mohammed Latif Siddiq, Natalie Sekerak, Antonio Karam +3 more

Large language models (LLMs) are increasingly used in software development, but their level of software security expertise remains unclear. This work...

3 months ago cs.SE cs.CR cs.LG PDF

Benchmark MEDIUM

Casting a SPELL: Sentence Pairing Exploration for LLM Limitation-breaking

Yifan Huang, Xiaojun Jia, Wenbo Guo +4 more

Large language models (LLMs) have revolutionized software development through AI-assisted coding tools, enabling developers with limited programming...

3 months ago cs.CR cs.AI cs.SE PDF

Defense LOW

RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic

Le Wang, Zonghao Ying, Xiao Yang +7 more

Embodied agents powered by vision-language models (VLMs) are increasingly capable of executing complex real-world tasks, yet they remain vulnerable...

3 months ago cs.AI cs.CV cs.RO PDF

Attack MEDIUM

Beyond Context: Large Language Models Failure to Grasp Users Intent

Ahmed M. Hussain, Salahuddin Salahuddin, Panos Papadimitratos

Current Large Language Models (LLMs) safety approaches focus on explicitly harmful content while overlooking a critical vulnerability: the inability...

3 months ago cs.AI cs.CL cs.CR PDF

Benchmark MEDIUM

LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics

Jiashuo Liu, Jiayun Wu, Chunjie Wu +5 more

The rapid proliferation of Large Language Models (LLMs) and diverse specialized benchmarks necessitates a shift from fragmented, task-specific...

3 months ago cs.LG cs.AI cs.PF PDF

Attack HIGH

GateBreaker: Gate-Guided Attacks on Mixture-of-Expert LLMs

Lichao Wu, Sasha Behrouzi, Mohamadreza Rostami +2 more

Mixture-of-Experts (MoE) architectures have advanced the scaling of Large Language Models (LLMs) by activating only a sparse subset of parameters per...

3 months ago cs.CR PDF

Attack HIGH

AegisAgent: An Autonomous Defense Agent Against Prompt Injection Attacks in LLM-HARs

Yihan Wang, Huanqi Yang, Shantanu Pal +1 more

The integration of Large Language Models (LLMs) into wearable sensing is creating a new class of mobile applications capable of nuanced human...

3 months ago cs.CR PDF

Attack MEDIUM

The Imitation Game: Using Large Language Models as Chatbots to Combat Chat-Based Cybercrimes

Yifan Yao, Baojuan Wang, Jinhao Duan +4 more

Chat-based cybercrime has emerged as a pervasive threat, with attackers leveraging real-time messaging platforms to conduct scams that rely on...

3 months ago cs.CR PDF

Defense MEDIUM

Safety Alignment of LMs via Non-cooperative Games

Anselm Paulus, Ilia Kulikov, Brandon Amos +4 more

Ensuring the safety of language models (LMs) while maintaining their usefulness remains a critical challenge in AI alignment. Current approaches rely...

3 months ago cs.AI PDF

Benchmark LOW

A Benchmark for Evaluating Outcome-Driven Constraint Violations in Autonomous AI Agents

Miles Q. Li, Benjamin C. M. Fung, Martin Weiss +3 more

As autonomous AI agents are increasingly deployed in high-stakes environments, ensuring their safety and alignment with human values has become a...

3 months ago cs.AI PDF

Attack HIGH

Real-World Adversarial Attacks on RF-Based Drone Detectors

Omer Gazit, Yael Itzhakev, Yuval Elovici +1 more

Radio frequency (RF) based systems are increasingly used to detect drones by analyzing their RF signal patterns, converting them into spectrogram...

3 months ago cs.CR cs.LG PDF

Benchmark MEDIUM

Evasion-Resilient Detection of DNS-over-HTTPS Data Exfiltration: A Practical Evaluation and Toolkit

Adam Elaoumari

The purpose of this project is to assess how well defenders can detect DNS-over-HTTPS (DoH) file exfiltration, and which evasion strategies can be...

3 months ago cs.CR cs.AI cs.NI PDF

Survey MEDIUM

ChatGPT: Excellent Paper! Accept It. Editor: Imposter Found! Review Rejected

Kanchon Gharami, Sanjiv Kumar Sarkar, Yongxin Liu +1 more

Large Language Models (LLMs) like ChatGPT are now widely used in writing and reviewing scientific papers. While this trend accelerates publication...

3 months ago cs.CR PDF

Survey MEDIUM

AprielGuard

Jaykumar Kasundra, Anjaneya Praharaj, Sourabh Surana +11 more

Safeguarding large language models (LLMs) against unsafe or adversarial behavior is critical as they are increasingly deployed in conversational and...

3 months ago cs.CL PDF

Benchmark HIGH

Well Begun is Half Done: Location-Aware and Trace-Guided Iterative Automated Vulnerability Repair

Zhenlei Ye, Xiaobing Sun, Sicong Cao +2 more

The advances of large language models (LLMs) have paved the way for automated software vulnerability repair approaches, which iteratively refine the...

3 months ago cs.SE PDF

Benchmark MEDIUM

Optimistic TEE-Rollups: A Hybrid Architecture for Scalable and Verifiable Generative AI Inference on Blockchain

Aaron Chan, Alex Ding, Frank Chen +3 more

The rapid integration of Large Language Models (LLMs) into decentralized physical infrastructure networks (DePIN) is currently bottlenecked by the...

3 months ago cs.CR PDF

Tool HIGH

Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography

Songze Li, Jiameng Cheng, Yiming Li +2 more

By integrating language understanding with perceptual modalities such as images, multimodal large language models (MLLMs) constitute a critical...

3 months ago cs.CR cs.AI cs.LG PDF

Attack MEDIUM

AI Security Beyond Core Domains: Resume Screening as a Case Study of Adversarial Vulnerabilities in Specialized LLM Applications

Honglin Mu, Jinghao Liu, Kaiyang Wan +4 more

Large Language Models (LLMs) excel at text comprehension and generation, making them ideal for automated tasks like code review and content...

3 months ago cs.CL cs.AI PDF

Other MEDIUM

On the Effectiveness of Instruction-Tuning Local LLMs for Identifying Software Vulnerabilities

Sangryu Park, Gihyuk Ko, Homook Cho

Large Language Models (LLMs) show significant promise in automating software vulnerability analysis, a critical task given the impact of security...

3 months ago cs.CR cs.AI PDF

Attack MEDIUM

IoT-based Android Malware Detection Using Graph Neural Network With Adversarial Defense

Rahul Yumlembam, Biju Issac, Seibu Mary Jacob +1 more

Since the Internet of Things (IoT) is widely adopted using Android applications, detecting malicious Android apps is essential. In recent years,...

3 months ago cs.CR cs.AI cs.LG PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial