Posts

Showing posts from June, 2026

Why Don’t LLM Agents Act as They Explain? The Faithfulness Gap in 3 Recent Papers

What Changed in Physics-Aware Diagram Generation and Physical Reasoning Benchmarks?