Revision as of 14:16, 19 May 2025 edit Alenoach (talk \| contribs) Extended confirmed users 5,805 edits m MOS:HEADCAPS Tag: Visual edit ← Previous edit		Revision as of 00:12, 26 May 2025 edit undo Czarking0 (talk \| contribs) Extended confirmed users, New page reviewers 7,321 edits Added {{Copy edit}} tag Tag: Twinkle Next edit →
Line 1: {{Short description\|Language models designed for reasoning tasks}}{{~~Merge~~Multiple toissues\|~~Reflection (artificial intelligence)\|date=April 2025}}~~ {{unreliable sources\|date=January 2025}} {{Copy edit\|for=jargon\|date=May 2025}} }} {{Merge to\|Reflection (artificial intelligence)\|date=April 2025}} '''Reasoning language models''' ('''RLMs''') are [[large language model]]s that have been further trained to solve multi-step reasoning tasks.<ref>{{cite arXiv \|title=Reasoning Language Models: A Blueprint \|last=Besta \|first=Maciej \|date=2025-01-23 \|eprint=2501.11223 \|class=cs.CL}}</ref> These models perform better on logical, mathematical or programmatic tasks than traditional autoregressive LLMs, have the ability to [[Backtracking\|backtrack]], and employ test-time compute as an additional [[Neural scaling law\|scaling axis]] beyond [[Training, validation, and test data sets\|training examples]], parameter count, and train-time compute.

Reasoning language model: Difference between revisions