Attack HIGH relevance

Detecting Prompt Injection Attacks Against Application Using Classifiers

Safwan Shaheer G. M. Refatul Islam Mohammad Rafid Hamid Md. Abrar Faiaz Khan Md. Omar Faruk Yaseen Nur

cs.CR cs.AI

Published

December 14, 2025

Updated

December 14, 2025

Links

PDF arxiv

Abstract

Prompt injection attacks can compromise the security and stability of critical systems, from infrastructure to large web applications. This work curates and augments a prompt injection dataset based on the HackAPrompt Playground Submissions corpus and trains several classifiers, including LSTM, feed forward neural networks, Random Forest, and Naive Bayes, to detect malicious prompts in LLM integrated web applications. The proposed approach improves prompt injection detection and mitigation, helping protect targeted applications and systems.

Metadata

Comment: 9 pages, X figures; undergraduate research project on detecting prompt injection attacks against LLM integrated web applications using classical machine learning and neural classifiers

Pro Analysis

Full threat analysis, ATLAS technique mapping, compliance impact assessment (ISO 42001, EU AI Act), and actionable recommendations are available with a Pro subscription.

Threat Deep-Dive

ATLAS Mapping

Compliance Reports

Actionable Recommendations

Start 14-Day Free Trial

Back to Research