Evolution of Puzzle Complexity in AI Reasoning

What is the Tower of Hanoi puzzle primarily designed to evaluate? 🧩
Difficulty: Easy
How does the complexity of the Checker Jumping puzzle scale with the number of checkers? 📈
Difficulty: Medium
What fundamental limitation is highlighted regarding reasoning models' ability to perform exact computation in the Tower of Hanoi puzzle? 🔍
Difficulty: Hard