AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total

2,077

Attack

809

Benchmark

603

Defense

272

Tool

226

Survey

113

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 321–340 of 355 papers

Clear filters

Benchmark MEDIUM

Text-to-Image Models Leave Identifiable Signatures: Implications for Leaderboard Security

Ali Naseh, Anshuman Suri, Yuefeng Peng +3 more

Generative AI leaderboards are central to evaluating model capabilities, but remain vulnerable to manipulation. Among key adversarial objectives is...

5 months ago cs.LG cs.CR PDF

Benchmark MEDIUM

DP-SNP-TIHMM: Differentially Private, Time-Inhomogeneous Hidden Markov Models for Synthesizing Genome-Wide Association Datasets

Shadi Rahimian, Mario Fritz

Single nucleotide polymorphism (SNP) datasets are fundamental to genetic studies but pose significant privacy risks when shared. The correlation of...

5 months ago cs.LG cs.CR q-bio.GN PDF

Benchmark MEDIUM

Towards Reliable and Practical LLM Security Evaluations via Bayesian Modelling

Mary Llewellyn, Annie Gray, Josh Collyer +1 more

Before adopting a new large language model (LLM) architecture, it is critical to understand vulnerabilities accurately. Existing evaluations can be...

5 months ago cs.CR cs.AI cs.CL PDF

Benchmark MEDIUM

WeatherArchive-Bench: Benchmarking Retrieval-Augmented Reasoning for Historical Weather Archives

Yongan Yu, Xianda Du, Qingchen Hu +7 more

Historical archives on weather events are collections of enduring primary source records that offer rich, untapped narratives of how societies have...

5 months ago cs.CL cs.AI PDF

Benchmark MEDIUM

DP-Adam-AC: Privacy-preserving Fine-Tuning of Localizable Language Models Using Adam Optimization with Adaptive Clipping

Ruoxing Yang

Large language models (LLMs) such as ChatGPT have evolved into powerful and ubiquitous tools. Fine-tuning on small datasets allows LLMs to acquire...

5 months ago cs.LG cs.AI cs.CR PDF

Benchmark MEDIUM

SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests

Punya Syon Pandey, Hai Son Le, Devansh Bhardwaj +2 more

Large language models (LLMs) are increasingly deployed in contexts where their failures can have direct sociopolitical consequences. Yet, existing...

5 months ago cs.CL cs.AI cs.LG PDF

Benchmark MEDIUM

Quantifying Distributional Robustness of Agentic Tool-Selection

Jehyeok Yeon, Isha Chaudhary, Gagandeep Singh

Large language models (LLMs) are increasingly deployed in agentic systems where they map user intents to relevant external tools to fulfill a task. A...

5 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

How Catastrophic is Your LLM? Certifying Risk in Conversation

Chengxiao Wang, Isha Chaudhary, Qian Hu +3 more

Large Language Models (LLMs) can produce catastrophic responses in conversational settings that pose serious risks to public safety and security....

5 months ago cs.AI cs.CR cs.LG PDF

Benchmark MEDIUM

LLM as an Algorithmist: Enhancing Anomaly Detectors via Programmatic Synthesis

Hangting Ye, Jinmeng Li, He Zhao +4 more

Existing anomaly detection (AD) methods for tabular data usually rely on some assumptions about anomaly patterns, leading to inconsistent performance...

5 months ago cs.LG PDF

Benchmark MEDIUM

Certifiable Safe RLHF: Fixed-Penalty Constraint Optimization for Safer Language Models

Kartik Pandit, Sourav Ganguly, Arnesh Banerjee +2 more

Ensuring safety is a foundational requirement for large language models (LLMs). Achieving an appropriate balance between enhancing the utility of...

5 months ago cs.LG cs.AI eess.SY PDF

Benchmark MEDIUM

FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents

Imene Kerboua, Sahar Omidi Shayegan, Megh Thakkar +7 more

Web agents powered by large language models (LLMs) must process lengthy web page observations to complete user goals; these pages often exceed tens...

5 months ago cs.CL PDF

Benchmark MEDIUM

Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain

Léo Boisvert, Abhay Puri, Chandra Kiran Reddy Evuru +6 more

While finetuning AI agents on interaction data -- such as web browsing or tool use -- improves their capabilities, it also introduces critical...

5 months ago cs.CR cs.AI cs.LG PDF

Benchmark MEDIUM

Zero-Shot Robustness of Vision Language Models Via Confidence-Aware Weighting

Nikoo Naghavian, Mostafa Tavassolipour

Vision-language models like CLIP demonstrate impressive zero-shot generalization but remain highly vulnerable to adversarial attacks. In this work,...

5 months ago cs.CV PDF

Benchmark MEDIUM

Who's Wearing? Ear Canal Biometric Key Extraction for User Authentication on Wireless Earbuds

Chenpei Huang, Lingfeng Yao, Hui Zhong +5 more

Ear canal scanning/sensing (ECS) has emerged as a novel biometric authentication method for mobile devices paired with wireless earbuds. Existing...

5 months ago cs.CR cs.HC PDF

Benchmark MEDIUM

Are LLMs Better GNN Helpers? Rethinking Robust Graph Learning under Deficiencies with Iterative Refinement

Zhaoyan Wang, Zheng Gao, Arogya Kharel +1 more

Graph Neural Networks (GNNs) are widely adopted in Web-related applications, serving as a core technique for learning from graph-structured data,...

5 months ago cs.LG cs.AI PDF

Benchmark MEDIUM

POLAR: Automating Cyber Threat Prioritization through LLM-Powered Assessment

Luoxi Tang, Yuqiao Meng, Ankita Patra +3 more

Large Language Models (LLMs) are intensively used to assist security analysts in counteracting the rapid exploitation of cyber threats, wherein LLMs...

5 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models

Luca Cotti, Idilio Drago, Anisa Rula +2 more

System logs represent a valuable source of Cyber Threat Intelligence (CTI), capturing attacker behaviors, exploited vulnerabilities, and traces of...

5 months ago cs.AI PDF

Benchmark MEDIUM

Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning

Yicheng Lang, Yihua Zhang, Chongyu Fan +3 more

Large language model (LLM) unlearning aims to surgically remove the influence of undesired data or knowledge from an existing model while preserving...

5 months ago cs.LG PDF

Benchmark MEDIUM

Sentry: Authenticating Machine Learning Artifacts on the Fly

Andrew Gan, Zahra Ghodsi

Machine learning systems increasingly rely on open-source artifacts such as datasets and models that are created or hosted by other parties. The...

5 months ago cs.CR PDF

Benchmark MEDIUM

SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence

Ehsan Aghaei, Sarthak Jain, Prashanth Arun +1 more

Effective analysis of cybersecurity and threat intelligence data demands language models that can interpret specialized terminology, complex document...

5 months ago cs.CR cs.AI cs.LG PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial