In addition, they show a counter-intuitive scaling limit: their reasoning work increases with challenge complexity around a point, then declines Even with having an enough token funds. By evaluating LRMs with their normal LLM counterparts less than equivalent inference compute, we discover a few functionality regimes: (1) lower-complexity duties https://www.youtube.com/watch?v=snr3is5MTiU