Attack HIGH
Zihan Wang, Guansong Pang, Wenjun Miao +2 more
Recent advances in Large Visual Language Models (LVLMs) have demonstrated impressive performance across various vision-language tasks by leveraging...
Benchmark LOW
Francis Rhys Ward, Teun van der Weij, Hanna Gábor +6 more
AI systems are increasingly able to autonomously conduct realistic software engineering tasks, and may soon be deployed to automate machine learning...
Defense MEDIUM
Jialin Wu, Kecen Li, Zhicong Huang +3 more
Many machine learning models are fine-tuned from large language models (LLMs) to achieve high performance in specialized domains like code...
4 months ago cs.CL cs.CR
PDF
Benchmark MEDIUM
Catherine Xia, Manar H. Alalfi
AI programming assistants have demonstrated a tendency to generate code containing basic security vulnerabilities. While developers are ultimately...
4 months ago cs.CR cs.AI
PDF
Survey MEDIUM
James Jin Kang, Dang Bui, Thanh Pham +1 more
The growing use of large language models in sensitive domains has exposed a critical weakness: the inability to ensure that private information can...
Survey MEDIUM
Gabrielle M Gauthier, Eesha Ali, Amna Asim +2 more
Human content moderators (CMs) routinely review distressing digital content at scale. Beyond exposure, the work context (e.g., workload, team...
Benchmark LOW
Yuankai He, Weisong Shi
CAR-Scenes is a frame-level dataset for autonomous driving that enables training and evaluation of vision-language models (VLMs) for interpretable,...
4 months ago cs.CV cs.RO
PDF
Defense MEDIUM
Daniyal Ganiuly, Nurzhau Bolatbek
The increasing virtualization of fifth generation (5G) networks expands the attack surface of the user plane, making spoofing a persistent threat to...
4 months ago cs.CR cs.NI
PDF
Benchmark LOW
Jiarui Liu, Kaustubh Dhole, Yingheng Wang +7 more
Deductive reasoning is the process of deriving conclusions strictly from the given premises, without relying on external knowledge. We define honesty...
Attack LOW
Xin Zhao, Xiaojun Chen, Bingshan Liu +3 more
Generative vision-language models like Stable Diffusion demonstrate remarkable capabilities in creative media synthesis, but they also pose...
4 months ago cs.AI cs.CR cs.CV
PDF
Benchmark MEDIUM
Zexu Wang, Jiachi Chen, Zewei Lin +7 more
Smart contracts have significantly advanced blockchain technology, and digital signatures are crucial for reliable verification of contract...
4 months ago cs.CR cs.SE
PDF
Attack HIGH
Shigeki Kusaka, Keita Saito, Mikoto Kudo +3 more
Large language models (LLMs) are increasingly deployed in real-world systems, making it critical to understand their vulnerabilities. While data...
4 months ago cs.LG cs.AI
PDF
Attack HIGH
Hongyi Li, Chengxuan Zhou, Chu Wang +5 more
Large Audio-language Models (LAMs) have recently enabled powerful speech-based interactions by coupling audio encoders with Large Language Models...
Benchmark LOW
Shengbo Wang, Hong Sun, Ke Li
Interactive preference elicitation (IPE) aims to substantially reduce human effort while acquiring human preferences in wide personalization systems....
Benchmark MEDIUM
Yunfei Yang, Xiaojun Chen, Yuexin Xuan +3 more
Model watermarking techniques can embed watermark information into the protected model for ownership declaration by constructing specific...
4 months ago cs.CR cs.LG
PDF
Benchmark MEDIUM
Kazuki Iwahana, Yusuke Yamasaki, Akira Ito +2 more
Backdoor attacks pose a critical threat to machine learning models, causing them to behave normally on clean data but misclassify poisoned data into...
4 months ago cs.LG cs.CR
PDF
Attack MEDIUM
Zixun Xiong, Gaoyi Wu, Qingyang Yu +5 more
Given the high cost of large language model (LLM) training from scratch, safeguarding LLM intellectual property (IP) has become increasingly crucial....
4 months ago cs.CR cs.AI
PDF
Other LOW
Jiahang He, Rishi Ramachandran, Neel Ramachandran +5 more
As large language models (LLMs) are adopted in an increasingly wide range of applications, user-model interactions have grown in both frequency and...
Attack HIGH
Tiago Machado, Maysa Malfiza Garcia de Macedo, Rogerio Abreu de Paula +5 more
This work aims to investigate how different Large Language Models (LLMs) alignment methods affect the models' responses to prompt attacks. We...
Defense LOW
Huzaifa Arif, Keerthiram Murugesan, Ching-Yun Ko +3 more
We propose patching for large language models (LLMs) like software versions, a lightweight and modular approach for addressing safety...
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial