AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total

2,077

Attack

809

Benchmark

603

Defense

272

Tool

226

Survey

113

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 2021–2040 of 2,077 papers

Attack HIGH

Formalization Driven LLM Prompt Jailbreaking via Reinforcement Learning

Zhaoqi Wang, Daqing He, Zijian Zhang +4 more

Large language models (LLMs) have demonstrated remarkable capabilities, yet they also introduce novel security challenges. For instance, prompt...

5 months ago cs.AI cs.CR PDF

Other MEDIUM

Contrastive Learning Enhances Language Model Based Cell Embeddings for Low-Sample Single Cell Transcriptomics

Luxuan Zhang, Douglas Jiang, Qinglong Wang +2 more

Large language models (LLMs) have shown strong ability in generating rich representations across domains such as natural language processing and...

5 months ago q-bio.GN cs.NE q-bio.MN PDF

Defense MEDIUM

ReliabilityRAG: Effective and Provably Robust Defense for RAG-based Web-Search

Zeyu Shen, Basileal Imana, Tong Wu +3 more

Retrieval-Augmented Generation (RAG) enhances Large Language Models by grounding their outputs in external documents. These systems, however, remain...

5 months ago cs.CR cs.AI PDF

Defense MEDIUM

Beyond Embeddings: Interpretable Feature Extraction for Binary Code Similarity

Charles E. Gagnon, Steven H. H. Ding, Philippe Charland +1 more

Binary code similarity detection is a core task in reverse engineering. It supports malware analysis and vulnerability discovery by identifying...

5 months ago cs.AI cs.CR cs.SE PDF

Attack MEDIUM

Dual-Space Smoothness for Robust and Balanced LLM Unlearning

Han Yan, Zheyuan Liu, Meng Jiang

With the rapid advancement of large language models, Machine Unlearning has emerged to address growing concerns around user privacy, copyright...

5 months ago cs.CL cs.AI PDF

Attack HIGH

Preventing Robotic Jailbreaking via Multimodal Domain Adaptation

Francesco Marchiori, Rohan Sinha, Christopher Agia +4 more

Large Language Models (LLMs) and Vision-Language Models (VLMs) are increasingly deployed in robotic environments but remain vulnerable to...

5 months ago cs.RO PDF

Benchmark MEDIUM

Reinforcement Learning-Based Prompt Template Stealing for Text-to-Image Models

Xiaotian Zou

Multimodal Large Language Models (MLLMs) have transformed text-to-image workflows, allowing designers to create novel visual concepts with...

5 months ago cs.CV cs.AI PDF

Defense LOW

Towards Quantum-Ready Blockchain Fraud Detection via Ensemble Graph Neural Networks

M. Z. Haider, Tayyaba Noreen, M. Salman

Blockchain Business applications and cryptocurrencies such as enable secure, decentralized value transfer, yet their pseudonymous nature creates...

5 months ago cs.LG cs.AI cs.CR PDF

Attack HIGH

Virus Infection Attack on LLMs: Your Poisoning Can Spread "VIA" Synthetic Data

Zi Liang, Qingqing Ye, Xuan Liu +3 more

Synthetic data refers to artificial samples generated by models. While it has been validated to significantly enhance the performance of large...

5 months ago cs.CR cs.AI cs.CL PDF

Attack HIGH

GuardNet: Graph-Attention Filtering for Jailbreak Defense in Large Language Models

Javad Forough, Mohammad Maheri, Hamed Haddadi

Large Language Models (LLMs) are increasingly susceptible to jailbreak attacks, which are adversarial prompts that bypass alignment constraints and...

5 months ago cs.LG PDF

Attack MEDIUM

LLM Watermark Evasion via Bias Inversion

Jeongyeon Hwang, Sangdon Park, Jungseul Ok

Watermarking offers a promising solution for detecting LLM-generated content, yet its robustness under realistic query-free (black-box) evasion...

5 months ago cs.CR cs.AI PDF

Attack HIGH

AntiFLipper: A Secure and Efficient Defense Against Label-Flipping Attacks in Federated Learning

Aashnan Rahman, Abid Hasan, Sherajul Arifin +5 more

Federated learning (FL) enables privacy-preserving model training by keeping data decentralized. However, it remains vulnerable to label-flipping...

5 months ago cs.CR PDF

Attack HIGH

Boundary on the Table: Efficient Black-Box Decision-Based Attacks for Structured Data

Roie Kazoom, Yuval Ratzabi, Etamar Rothstein +1 more

Adversarial robustness in structured data remains an underexplored frontier compared to vision and language domains. In this work, we introduce a...

5 months ago cs.LG cs.AI PDF

Attack HIGH

ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents

Hwan Chang, Yonghyun Jun, Hwanhee Lee

The growing deployment of large language model (LLM) based agents that interact with external environments has created new attack surfaces for...

6 months ago cs.CL PDF

Attack MEDIUM

What Do They Fix? LLM-Aided Categorization of Security Patches for Critical Memory Bugs

Xingyu Li, Juefei Pu, Yifan Wu +13 more

Open-source software projects are foundational to modern software ecosystems, with the Linux kernel standing out as a critical exemplar due to its...

6 months ago cs.CR cs.LG PDF

Benchmark MEDIUM

Evaluating the Limits of Large Language Models in Multilingual Legal Reasoning

Antreas Ioannou, Andreas Shiamishis, Nora Hollenstein +1 more

In an era dominated by Large Language Models (LLMs), understanding their capabilities and limitations, especially in high-stakes fields like law, is...

6 months ago cs.CL cs.AI cs.LG PDF

Benchmark LOW

Investigating Faithfulness in Large Audio Language Models

Pooneh Mousavi, Lovenya Jain, Mirco Ravanelli +1 more

Large Audio Language Models (LALMs) integrate audio encoders with pretrained Large Language Models to perform complex multimodal reasoning tasks....

6 months ago cs.LG eess.AS PDF

Attack HIGH

Jailbreaking on Text-to-Video Models via Scene Splitting Strategy

Wonjun Lee, Haon Park, Doehyeon Lee +2 more

Along with the rapid advancement of numerous Text-to-Video (T2V) models, growing concerns have emerged regarding their safety risks. While recent...

6 months ago cs.CV cs.AI PDF

Other LOW

Leveraging Large Language Models for Robot-Assisted Learning of Morphological Structures in Preschool Children with Language Vulnerabilities

Stina Sundstedt, Mattias Wingren, Susanne Hägglund +1 more

Preschool children with language vulnerabilities -- such as developmental language disorders or immigration related language challenges -- often...

6 months ago cs.RO cs.AI cs.HC PDF

Benchmark MEDIUM

Erase or Hide? Suppressing Spurious Unlearning Neurons for Robust Unlearning

Nakyeong Yang, Dong-Kyum Kim, Jea Kwon +3 more

Large language models trained on web-scale data can memorize private or sensitive knowledge, raising significant privacy risks. Although some...

6 months ago cs.LG PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial