Safety, Efficiency, and Real-World Use of LLM Agents: Reading Four Recent arXiv Papers
Safety, Efficiency, and Real-World Use of LLM Agents: Reading Four Recent arXiv Papers This brief looks at four recent arXiv papers that approach LLM systems from different but connected angles: how agents should communicate with each other, how prompt injection and jailbreak attempts can be detected, how safety mechanisms can themselves create new attack surfaces, and what really explains gains in RAG rewriting. The papers are “What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems,” “GuardNet: Ensemble Strategies of Shallow Neural Networks for Robust Prompt Injection and Jailbreak Detection,” “Safety Paradox: How Enhanced Safety Awareness Leaves LLMs Vulnerable to Posterior Attack,” and “Answer Presence Drives RAG Rewriting Gains,” all introduced as new arXiv submissions in the selected source summaries. Taken together, they ask a practical question: when we build LLM systems for real use, what should we optimize first—communication efficiency, safety f...