Posts

Showing posts with the label LLM

What Data Shapes LLM Performance? Why This Paper Proposes Data Probes

Three Recent AI Papers on Agents, Documents, and Data: What Has Changed for Real-World LLM Systems?

Why LLMs Lose Context in Multi-Turn Interaction: What Three New Papers Suggest About Causes and Responses

Can LLMs Reuse Tools Creatively? What CreativityBench Tries to Measure

Why Does LLM Diversity Shrink? Reconsidering Generative Diversity After Supervised Fine-Tuning

When Does LLM Self-Correction Actually Help? Papers on Iterative Refinement, Evaluation, and Reliability

Is LLM Reasoning Really a Chain of Thought? What a New Paper Questions

Rethinking LLM Reasoning as Internal State Change, Not Visible Chain-of-Thought