Posts
Showing posts with the label LLM agents
Recent Papers on LLM Agents: Memory, Negotiation, and Structural Failure
- Get link
- X
- Other Apps
Three Recent Papers on Making LLM Agent Execution More Reliable: SDOF, SkillSmith, and STAR
- Get link
- X
- Other Apps
Two Axes for Reading LLM Agent Design: What the Agent Does and How It Runs
- Get link
- X
- Other Apps
Designing Safer LLM Agents: Key Issues from Recent Papers
- Get link
- X
- Other Apps
How Conversational LLM Agents Choose the Next Question: BALAR and PRISM
- Get link
- X
- Other Apps
When Do Tools Help LLM Agents, and When Do They Backfire?
- Get link
- X
- Other Apps
LLM Agents and Scientific Discovery: What Four New arXiv Papers Suggest About the Next Wave of Automation
- Get link
- X
- Other Apps
DreamProver and AGEL-Comp: What LLM Agents Need to Reason Better and Generalize Further
- Get link
- X
- Other Apps
Three Recent Papers on Making LLM Agents More Stable in Planning and Reasoning
- Get link
- X
- Other Apps
Two Ways to Stabilize LLM Agents on Complex Tasks: Hierarchical Planning and CAP-CoT
- Get link
- X
- Other Apps