AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total

2,077

Attack

809

Benchmark

603

Defense

272

Tool

226

Survey

113

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 141–160 of 272 papers

Clear filters

Defense MEDIUM

Safety Alignment of LMs via Non-cooperative Games

Anselm Paulus, Ilia Kulikov, Brandon Amos +4 more

Ensuring the safety of language models (LMs) while maintaining their usefulness remains a critical challenge in AI alignment. Current approaches rely...

3 months ago cs.AI PDF

Defense MEDIUM

Elevating Intrusion Detection and Security Fortification in Intelligent Networks through Cutting-Edge Machine Learning Paradigms

Md Minhazul Islam Munna, Md Mahbubur Rahman, Jaroslav Frnda +2 more

The proliferation of IoT devices and their reliance on Wi-Fi networks have introduced significant security vulnerabilities, particularly the KRACK...

3 months ago cs.CR cs.LG PDF

Defense MEDIUM

R-GenIMA: Integrating Neuroimaging and Genetics with Interpretable Multimodal AI for Alzheimer's Disease Progression

Kun Zhao, Siyuan Dai, Yingying Zhang +9 more

Early detection of Alzheimer's disease (AD) requires models capable of integrating macro-scale neuroanatomical alterations with micro-scale genetic...

3 months ago cs.LG cs.AI PDF

Defense LOW

"Even GPT Can Reject Me": Conceptualizing Abrupt Refusal Secondary Harm (ARSH) and Reimagining Psychological AI Safety with Compassionate Completion Standard (CCS)

Yang Ni, Tong Yang

Large Language Models (LLMs) and AI chatbots are increasingly used for emotional and mental health support due to their low cost, immediacy, and...

3 months ago cs.CY cs.HC PDF

Defense MEDIUM

Rubric-Conditioned LLM Grading: Alignment, Uncertainty, and Robustness

Haotian Deng, Chris Farber, Jiyoon Lee +1 more

Automated short-answer grading (ASAG) remains a challenging task due to the linguistic variability of student responses and the need for nuanced,...

3 months ago cs.CL cs.LG PDF

Defense LOW

Emergent Learner Agency in Implicit Human-AI Collaboration: How AI Personas Reshape Creative-Regulatory Interaction

Yueqiao Jin, Roberto Martinez-Maldonado, Dragan Gašević +1 more

Generative AI is increasingly embedded in collaborative learning, yet little is known about how AI personas shape learner agency when AI teammates...

3 months ago cs.HC PDF

Defense LOW

Distributional AGI Safety

Nenad Tomašev, Matija Franklin, Julian Jacobs +2 more

AI safety and alignment research has predominantly been focused on methods for safeguarding individual AI systems, resting on the assumption of an...

3 months ago cs.AI PDF

Defense LOW

From Personalization to Prejudice: Bias and Discrimination in Memory-Enhanced AI Agents for Recruitment

Himanshu Gharat, Himanshi Agrawal, Gourab K. Patro

Large Language Models (LLMs) have empowered AI agents with advanced capabilities for understanding, reasoning, and interacting across diverse tasks....

3 months ago cs.AI cs.IR PDF

Defense MEDIUM

From Essence to Defense: Adaptive Semantic-aware Watermarking for Embedding-as-a-Service Copyright Protection

Hao Li, Yubing Ren, Yanan Cao +3 more

Benefiting from the superior capabilities of large language models in natural language understanding and generation, Embeddings-as-a-Service (EaaS)...

3 months ago cs.CR cs.CL PDF

Defense LOW

PediatricAnxietyBench: Evaluating Large Language Model Safety Under Parental Anxiety and Pressure in Pediatric Consultations

Vahideh Zolfaghari

Large language models (LLMs) are increasingly consulted by parents for pediatric guidance, yet their safety under real-world adversarial pressures is...

3 months ago cs.AI PDF

Defense MEDIUM

Cloud Security Leveraging AI: A Fusion-Based AISOC for Malware and Log Behaviour Detection

Nnamdi Philip Okonkwo, Lubna Luxmi Dhirani

Cloud Security Operations Center (SOC) enable cloud governance, risk and compliance by providing insights visibility and control. Cloud SOC triages...

3 months ago cs.CR cs.LG PDF

Defense MEDIUM

C-ing Clearly: Enhanced Binary Code Explanations using C code

Teodor Poncu, Ioana Pintilie, Marius Dragoi +2 more

Large Language Models (LLMs) typically excel at coding tasks involving high-level programming languages, as opposed to lower-level programming...

3 months ago cs.CL cs.LG PDF

Defense MEDIUM

Auto-Tuning Safety Guardrails for Black-Box Large Language Models

Perry Abdulkadir

Large language models (LLMs) are increasingly deployed behind safety guardrails such as system prompts and content filters, especially in settings...

3 months ago cs.CR cs.CL cs.LG PDF

Defense MEDIUM

Taint-Based Code Slicing for LLMs-based Malicious NPM Package Detection

Dang-Khoa Nguyen, Gia-Thang Ho, Quang-Minh Pham +5 more

Software supply chain attacks targeting the npm ecosystem have become increasingly sophisticated, leveraging obfuscation and complex logic to evade...

3 months ago cs.CR PDF

Defense MEDIUM

Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneously

Andrew Adiletta, Kathryn Adiletta, Kemal Derya +1 more

The rapid deployment of Large Language Models (LLMs) has created an urgent need for enhanced security and privacy measures in Machine Learning (ML)....

3 months ago cs.CR cs.AI PDF

Defense MEDIUM

Challenges of Evaluating LLM Safety for User Welfare

Manon Kempermann, Sai Suresh Macharla Vasu, Mahalakshmi Raveenthiran +2 more

Safety evaluations of large language models (LLMs) typically focus on universal risks like dangerous capabilities or undesirable propensities....

3 months ago cs.AI cs.CY PDF

Defense MEDIUM

Phishing Email Detection Using Large Language Models

Najmul Hasan, Prashanth BusiReddyGari, Haitao Zhao +3 more

Email phishing is one of the most prevalent and globally consequential vectors of cyber intrusion. As systems increasingly deploy Large Language...

3 months ago cs.CR cs.IR PDF

Defense MEDIUM

Black-Box Behavioral Distillation Breaks Safety Alignment in Medical LLMs

Sohely Jahan, Ruimin Sun

As medical large language models (LLMs) become increasingly integrated into clinical workflows, concerns around alignment robustness, and safety are...

3 months ago cs.LG PDF

Defense MEDIUM

Secure and Privacy-Preserving Federated Learning for Next-Generation Underground Mine Safety

Mohamed Elmahallawy, Sanjay Madria, Samuel Frimpong

Underground mining operations depend on sensor networks to monitor critical parameters such as temperature, gas concentration, and miner movement,...

3 months ago cs.CR cs.LG PDF

Defense HIGH

Llama-based source code vulnerability detection: Prompt engineering vs Fine tuning

Dyna Soumhane Ouchebara, Stéphane Dupont

The significant increase in software production, driven by the acceleration of development cycles over the past two decades, has led to a steady rise...

3 months ago cs.SE cs.AI cs.CR PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial