Skip to main content

Search This Blog

code_204

Posts

DreamProver and AGEL-Comp: What LLM Agents Need to Reason Better and Generalize Further

Get link
Facebook
X
Pinterest
Email
Other Apps

Three Recent Papers on Making LLM Agents More Stable in Planning and Reasoning

Get link
Facebook
X
Pinterest
Email
Other Apps

Two Ways to Stabilize LLM Agents on Complex Tasks: Hierarchical Planning and CAP-CoT

Get link
Facebook
X
Pinterest
Email
Other Apps

When Does LLM Self-Correction Actually Help? Papers on Iterative Refinement, Evaluation, and Reliability

Get link
Facebook
X
Pinterest
Email
Other Apps

AI Agents in Practice: Workflow Integration and Real-World Use Cases

Get link
Facebook
X
Pinterest
Email
Other Apps

How LLM Agents Combine Decision-Making and Skill Use in Long-Horizon Tasks

Get link
Facebook
X
Pinterest
Email
Other Apps

Tool Choice and Interpretability in LLM Agents: Key Ideas from Three Recent Papers

Get link
Facebook
X
Pinterest
Email
Other Apps

Why LLM Agents Still Struggle With Scientific Reasoning: Limits and Responses From Recent Papers

Get link
Facebook
X
Pinterest
Email
Other Apps

Is LLM Reasoning Really a Chain of Thought? What a New Paper Questions

Get link
Facebook
X
Pinterest
Email
Other Apps

Rethinking LLM Reasoning as Internal State Change, Not Visible Chain-of-Thought

Get link
Facebook
X
Pinterest
Email
Other Apps

Why LLM Agents Stay Unstable: Three Recent arXiv Papers on Reliability, Web Skill Learning, and Reasoning Limits

Get link
Facebook
X
Pinterest
Email
Other Apps

Newer Posts Older Posts Home

Powered by Blogger

Theme images by Mae Burke

Code204

Archive

June 20269
May 202624
April 202615
June 20232
May 202319

Labels

AGEL-Comp1
agent1
agent architecture2
agent evaluation2
Agent Evaluation1
agent memory6
agent orchestration2
agent reasoning1
agent reliability2
agent safety2

Show more Show less

Report Abuse