AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total
2,077
Attack
809
Benchmark
603
Defense
272
Tool
226
Survey
113

Showing 481–500 of 598 papers

Clear filters
Benchmark LOW

Dynamic Evaluation for Oversensitivity in LLMs

Sophia Xiao Pu, Sitao Cheng, Xin Eric Wang +1 more

Oversensitivity occurs when language models defensively reject prompts that are actually benign. This behavior not only disrupts user interactions...

5 months ago cs.CL PDF
Benchmark LOW

VERA-MH Concept Paper

Luca Belli, Kate Bentley, Will Alexander +5 more

We introduce VERA-MH (Validation of Ethical and Responsible AI in Mental Health), an automated evaluation of the safety of AI chatbots used in mental...

5 months ago cs.CY cs.AI cs.HC PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial