Reasoning Categories
Not all reasoning is the same. LLMs struggle differently with each type: Mathematical reasoning — arithmetic, algebra, word problems. LLMs fail on novel computations but can solve problems similar to training data. Chain-of-thought helps significantly. Logical reasoning — deduction, induction, abduction. LLMs struggle with formal logic, especially negation and quantifiers (“all”, “some”, “none”). Commonsense reasoning — understanding physical world, social norms, temporal relationships. LLMs are surprisingly good at this because it’s heavily represented in training data. Causal reasoning — understanding cause and effect, counterfactuals. LLMs learn correlations, not causation. Planning — multi-step goal-directed behavior with state tracking. One of the weakest areas for LLMs.
Reasoning Difficulty
// LLM reasoning capabilities
Commonsense: ████████░░ Good
"Is ice cream usually cold?" → Yes
Well-represented in training data
Mathematical: █████░░░░░ Mixed
Simple: "5 × 3 = ?" → 15 ✓
Complex: "∫x²dx from 0 to 3" → ?
CoT helps dramatically
Logical: ████░░░░░░ Weak
"All A are B. All B are C.
Are all A also C?" → Usually ✓
"Not all A are B..." → Often ✗
Causal: ███░░░░░░░ Weak
Correlation ≠ causation
Counterfactuals are hard
Planning: ██░░░░░░░░ Very Weak
State tracking fails
Can't search solution spaces
// Worst reasoning category
Key insight: LLMs are best at commonsense reasoning (pattern-heavy) and worst at planning (requires search and state tracking). The techniques in this course — CoT, ToT, tool use — target the weak areas by adding structure and computation to the reasoning process.