Survey MEDIUM
Kiarash Ahi, Vaibhav Agrawal, Saeed Valizadeh
Large Language Models (LLMs) & Generative AI are transforming cybersecurity, enabling both advanced defenses and new attacks. Organizations now use...
Tool MEDIUM
Emmanuel Bamidele
Long-running LLM agents require persistent memory to preserve state across interactions, yet most deployed systems manage memory with age-based...
1 months ago cs.DC cs.AI cs.LG
PDF
Attack HIGH
Shenyang Chen, Liuwan Zhu
Standard evaluations of backdoor attacks on text-to-image (T2I) models primarily measure trigger activation and visual fidelity. We challenge this...
1 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Abdullah Caglar Oksuz, Anisa Halimi, Erman Ayday
Membership inference attacks (MIAs) threaten the privacy of machine learning models by revealing whether a specific data point was used during...
1 months ago cs.LG cs.CR
PDF
Defense MEDIUM
Chun Yan Ryan Kan, Tommy Tran, Vedant Yadav +4 more
Defending LLMs against adversarial jailbreak attacks remains an open challenge. Existing defenses rely on binary classifiers that fail when...
1 months ago cs.CR cs.AI cs.CL
PDF
Attack HIGH
Zafir Shamsi, Nikhil Chekuru, Zachary Guzman +1 more
Large Language Models (LLMs) are increasingly integrated into high-stakes applications, making robust safety guarantees a central practical and...
1 months ago cs.CL cs.AI
PDF
Benchmark LOW
Martin Bertran, Riccardo Fogliato, Zhiwei Steven Wu
Empirical conclusions depend not only on data but on analytic decisions made throughout the research process. Many-analyst studies have quantified...
1 months ago cs.AI cs.LG
PDF
Benchmark HIGH
Mirae Kim, Seonghun Jeong, Youngjun Kwak
Jailbreaking poses a significant risk to the deployment of Large Language Models (LLMs) and Vision Language Models (VLMs). VLMs are particularly...
1 months ago cs.CL cs.AI cs.DB
PDF
Benchmark LOW
Anna Babarczy, Andras Lukacs, Peter Vedres +1 more
The study explores whether current Large Language Models (LLMs) exhibit Theory of Mind (ToM) capabilities -- specifically, the ability to infer...
1 months ago cs.CL cs.AI
PDF
Attack MEDIUM
Diego Soi, Silvia Lucia Sanna, Lorenzo Pisu +2 more
In recent years, stealthy Android malware has increasingly adopted sophisticated techniques to bypass automatic detection mechanisms and harden...
Tool HIGH
Phan The Duy, Nghi Hoang Khoa, Nguyen Tran Anh Quan +3 more
The increasing deployment of Federated Learning (FL) in Intrusion Detection Systems (IDS) introduces new challenges related to data privacy,...
1 months ago cs.CR cs.AI
PDF
Defense LOW
Imgyeong Lee, Tayyib Ul Hassan, Abram Hindle
Artificial Intelligence (AI) increasingly automates various parts of the software development tasks. Although AI has enhanced the productivity of...
Attack HIGH
Jingkai Guo, Chaitali Chakrabarti, Deliang Fan
Large language models (LLMs) are increasingly deployed in safety and security critical applications, raising concerns about their robustness to model...
1 months ago cs.CR cs.CL cs.LG
PDF
Attack HIGH
Manuel Wirth
As Large Language Models (LLMs) are increasingly integrated into automated decision-making pipelines, specifically within Human Resources (HR), the...
1 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Zachary Coalson, Bo Fang, Sanghyun Hong
Multi-turn interaction length is a dominant factor in the operational costs of conversational LLMs. In this work, we present a new failure mode in...
1 months ago cs.LG cs.CR
PDF
Tool LOW
Leon Staufer, Kevin Feng, Kevin Wei +6 more
Agentic AI systems are increasingly capable of performing professional and personal tasks with limited human involvement. However, tracking these...
1 months ago cs.CY cs.AI
PDF
Benchmark MEDIUM
Gelei Deng, Yi Liu, Yuekang Li +5 more
LLM-based agents show promise for automating penetration testing, yet reported performance varies widely across systems and benchmarks. We analyze 28...
1 months ago cs.CR cs.SE
PDF
Attack LOW
Wyatt Benno, Alberto Centelles, Antoine Douchet +1 more
We present Jolt Atlas, a zero-knowledge machine learning (zkML) framework that extends the Jolt proving system to model inference. Unlike zkVMs...
1 months ago cs.CR cs.AI
PDF
Survey MEDIUM
Boyang Ma, Hechuan Guo, Peizhuo Lv +5 more
Embodied AI systems (e.g., autonomous vehicles, service robots, and LLM-driven interactive agents) are rapidly transitioning from controlled...
1 months ago cs.CR cs.AI
PDF
Benchmark LOW
Takyoung Kim, Jinseok Nam, Chandrayee Basu +5 more
Conversational agents powered by large language models (LLMs) with tool integration achieve strong performance on fixed task-oriented dialogue...
1 months ago cs.CL cs.AI
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial