Attack MEDIUM
Zaixi Zhang, Souradip Chakraborty, Amrit Singh Bedi +16 more
The rapid adoption of generative artificial intelligence (GenAI) in the biosciences is transforming biotechnology, medicine, and synthetic biology....
5 months ago cs.CR q-bio.BM
PDF
Attack MEDIUM
Tiarnaigh Downey-Webb, Olamide Jogunola, Oluwaseun Ajao
This paper presents a systematic security assessment of four prominent Large Language Models (LLMs) against diverse adversarial attack vectors. We...
5 months ago cs.CR cs.AI cs.CY
PDF
Benchmark MEDIUM
Mohan Zhang, Yihua Zhang, Jinghan Jia +3 more
Modern large reasoning models (LRMs) exhibit impressive multi-step problem-solving via chain-of-thought (CoT) reasoning. However, this iterative...
5 months ago cs.LG cs.AI cs.CR
PDF
Benchmark MEDIUM
Shaolun Liu, Sina Marefat, Omar Tsai +4 more
GraphQL's flexible query model and nested data dependencies expose APIs to complex, context-dependent vulnerabilities that are difficult to uncover...
5 months ago cs.CR cs.SE
PDF
Benchmark MEDIUM
Zonghao Ying, Yangguang Shao, Jianle Gan +9 more
Large vision-language model (LVLM)-based web agents are emerging as powerful tools for automating complex online tasks. However, when deployed in...
5 months ago cs.CR cs.CV
PDF
Defense MEDIUM
Yuyi Huang, Runzhe Zhan, Lidia S. Chao +2 more
As large language models (LLMs) are increasingly deployed for complex reasoning tasks, Long Chain-of-Thought (Long-CoT) prompting has emerged as a...
Benchmark MEDIUM
Ines Altemir Marinas, Anastasiia Kucherenko, Alexander Sternfeld +1 more
The performance of Large Language Models (LLMs) is determined by their training data. Despite the proliferation of open-weight LLMs, access to LLM...
Benchmark MEDIUM
Yongding Tao, Tian Wang, Yihong Dong +4 more
Data contamination poses a significant threat to the reliable evaluation of Large Language Models (LLMs). This issue arises when benchmark samples...
5 months ago cs.CL cs.AI cs.LG
PDF
Defense MEDIUM
MingSheng Li, Guangze Zhao, Sichen Liu
Large Vision-Language Models (LVLMs) have achieved remarkable progress in multimodal perception and generation, yet their safety alignment remains a...
5 months ago cs.AI cs.CR
PDF
Other MEDIUM
Sicheol Sung, Joonghyuk Hahn, Yo-Sub Han
Regular expressions (regexes) are foundational to modern computing for critical tasks like input validation and data parsing, yet their ubiquity...
5 months ago cs.AI cs.PL
PDF
Benchmark MEDIUM
Xiaonan Si, Meilin Zhu, Simeng Qin +7 more
Retrieval-augmented generation (RAG) systems enhance large language models (LLMs) with external knowledge but are vulnerable to corpus poisoning and...
5 months ago cs.CL cs.AI
PDF
Attack MEDIUM
Brandon Lit, Edward Crowder, Daniel Vogel +1 more
AI chatbots are an emerging security attack vector, vulnerable to threats such as prompt injection, and rogue chatbot creation. When deployed in...
Benchmark MEDIUM
Debeshee Das, Luca Beurer-Kellner, Marc Fischer +1 more
The increasing adoption of LLM agents with access to numerous tools and sensitive data significantly widens the attack surface for indirect prompt...
5 months ago cs.CR cs.AI cs.LG
PDF
Attack MEDIUM
Abhishek K. Mishra, Antoine Boutet, Lucas Magnana
Large Language Models (LLMs) are increasingly deployed across multilingual applications that handle sensitive data, yet their scale and linguistic...
5 months ago cs.CL cs.CR
PDF
Attack MEDIUM
Aofan Liu, Lulu Tang
Vision-Language Models (VLMs) have garnered significant attention for their remarkable ability to interpret and generate multimodal content. However,...
5 months ago cs.CR cs.AI
PDF
Attack MEDIUM
Jiyang Qiu, Xinbei Ma, Yunqing Xu +2 more
The rapid deployment of large language model (LLM)-based agents in real-world applications has raised serious concerns about their trustworthiness....
Defense MEDIUM
Xiangtao Meng, Tianshuo Cong, Li Wang +4 more
Large Language Models (LLMs) have shown remarkable performance across various applications, but their deployment in real-world settings faces several...
Benchmark MEDIUM
Eric Hanchen Jiang, Weixuan Ou, Run Liu +8 more
Safety alignment of large language models currently faces a central challenge: existing alignment techniques often prioritize mitigating responses to...
5 months ago cs.LG cs.AI cs.CL
PDF
Survey MEDIUM
Man Hu, Xinyi Wu, Zuofeng Suo +5 more
With the rise of advanced reasoning capabilities, large language models (LLMs) are receiving increasing attention. However, although reasoning...
5 months ago cs.CR cs.AI
PDF
Survey MEDIUM
Chongyu Fan, Changsheng Wang, Yancheng Huang +2 more
Machine unlearning for large language models (LLMs) aims to remove undesired data, knowledge, and behaviors (e.g., for safety, privacy, or copyright)...
5 months ago cs.LG cs.CL
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial