What's more, they exhibit a counter-intuitive scaling limit: their reasoning effort increases with difficulty complexity approximately some extent, then declines Even with having an sufficient token spending plan. By comparing LRMs with their typical LLM counterparts below equal inference compute, we recognize a few effectiveness regimes: (1) low-complexity tasks https://www.youtube.com/watch?v=snr3is5MTiU