Posts

Showing posts with the label paper brief

Four Recent Papers on Reliable LLM Agents: Verification, Runtime Policy, Memory, and Privacy

Why Do LLM Agent Memories Keep Failing? Three Recent Papers on the Core Problems

What Data Shapes LLM Performance? Why This Paper Proposes Data Probes

Two Axes for Reading LLM Agent Design: What the Agent Does and How It Runs

Why LLMs Lose Context in Multi-Turn Interaction: What Three New Papers Suggest About Causes and Responses

When Do Tools Help LLM Agents, and When Do They Backfire?

Two Ways to Stabilize LLM Agents on Complex Tasks: Hierarchical Planning and CAP-CoT

When Does LLM Self-Correction Actually Help? Papers on Iterative Refinement, Evaluation, and Reliability

Is LLM Reasoning Really a Chain of Thought? What a New Paper Questions

Rethinking LLM Reasoning as Internal State Change, Not Visible Chain-of-Thought