Karolina Korgul, Yushi Yang, Arkadiusz Drohomirecki +7 more
Web-based agents powered by large language models are increasingly used for tasks such as email management or professional networking. Their reliance...
Large language models (LLMs) have revolutionized software development through AI-assisted coding tools, enabling developers with limited programming...
Ahmed M. Hussain, Salahuddin Salahuddin, Panos Papadimitratos
Current Large Language Models (LLMs) safety approaches focus on explicitly harmful content while overlooking a critical vulnerability: the inability...
Ensuring the safety of language models (LMs) while maintaining their usefulness remains a critical challenge in AI alignment. Current approaches rely...
Jaykumar Kasundra, Anjaneya Praharaj, Sourabh Surana +11 more
Safeguarding large language models (LLMs) against unsafe or adversarial behavior is critical as they are increasingly deployed in conversational and...
Naseem Machlovi, Maryam Saleki, Ruhul Amin +5 more
As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer moderation systems, distinguishing between naive from...
Naseem Machlovi, Maryam Saleki, Ruhul Amin +5 more
As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer moderation systems that distinguish between naive and...