Content deleted Content added
Xose.vazquez (talk | contribs) |
Amberkitten (talk | contribs) m →2024: fixed link |
||
Line 14:
[[Alibaba Group|Alibaba]] also released reasoning versions of its [[Qwen]] LLMs in November 2024.
In December 2024, Google introduced [[Gemini Deep
On December 16, 2024, an experiment using a [[Llama (language model)|Llama]] 3B model demonstrated that by scaling test-time compute, a relatively small model could outperform a much larger Llama 70B model on challenging reasoning tasks. This result highlighted that improved inference strategies can unlock latent reasoning capabilities even in compact models.<ref>{{Cite web |title=Scaling test-time compute - a Hugging Face Space by HuggingFaceH4 |url=https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute |access-date=2025-02-05 |website=huggingface.co}}</ref>
|