Reasoning language model: Difference between revisions

Content deleted Content added
m 2024: fixed link
Line 14:
[[Alibaba Group|Alibaba]] also released reasoning versions of its [[Qwen]] LLMs in November 2024.
 
In December 2024, Google introduced [[Gemini Deep researchResearch|Deep Research]] in [[Gemini (chatbot)|Gemini]],<ref>{{Cite web |date=2024-12-11 |title=Try Deep Research and our new experimental model in Gemini, your AI assistant |url=https://blog.google/products/gemini/google-gemini-deep-research/ |access-date=2025-02-05 |website=Google |language=en-us}}</ref> a feature in Gemini that conducts multi-step research tasks.
 
On December 16, 2024, an experiment using a [[Llama (language model)|Llama]] 3B model demonstrated that by scaling test-time compute, a relatively small model could outperform a much larger Llama 70B model on challenging reasoning tasks. This result highlighted that improved inference strategies can unlock latent reasoning capabilities even in compact models.<ref>{{Cite web |title=Scaling test-time compute - a Hugging Face Space by HuggingFaceH4 |url=https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute |access-date=2025-02-05 |website=huggingface.co}}</ref>