Posts
Showing posts from June, 2026
Why Don’t LLM Agents Act as They Explain? The Faithfulness Gap in 3 Recent Papers
- Get link
- X
- Other Apps
What Changed in Physics-Aware Diagram Generation and Physical Reasoning Benchmarks?
- Get link
- X
- Other Apps