Revision as of 19:24, 8 June 2025 edit Maxeto0910 (talk \| contribs) Extended confirmed users 116,705 edits →2025 Tag: Visual edit ← Previous edit		Revision as of 07:34, 9 June 2025 edit undo Quensen (talk \| contribs) 43 edits No edit summary Tags: Mobile edit Mobile app edit iOS app edit App section source Next edit →
Line 4: }} {{Merge to\|Reflection (artificial intelligence)\|date=April 2025}} '''Reasoning language models''' ('''RLMs''') are [[large language model]]s that have been further trained to solve multi-step [[reasoning]] tasks.<ref>{{cite arXiv \|title=Reasoning Language Models: A Blueprint \|last=Besta \|first=Maciej \|date=2025-01-23 \|eprint=2501.11223 \|class=cs.CL}}</ref> These models perform better on logical, mathematical or programmatic tasks than traditional autoregressive LLMs, have the ability to [[Backtracking\|backtrack]], and employ test-time compute as an additional [[Neural scaling law\|scaling axis]] beyond [[Training, validation, and test data sets\|training examples]], parameter count, and train-time compute. == History ==

Reasoning language model: Difference between revisions