Revision as of 17:58, 1 June 2025 edit Xose.vazquez (talk \| contribs) Extended confirmed users 10,808 edits →Models ← Previous edit		Revision as of 19:30, 2 June 2025 edit undo Amberkitten (talk \| contribs) 341 edits m →2024: fixed link Tag: Visual edit Next edit →
Line 14: [[Alibaba Group\|Alibaba]] also released reasoning versions of its [[Qwen]] LLMs in November 2024. In December 2024, Google introduced [[Gemini Deep ~~research~~Research\|Deep Research]] in [[Gemini (chatbot)\|Gemini]],<ref>{{Cite web \|date=2024-12-11 \|title=Try Deep Research and our new experimental model in Gemini, your AI assistant \|url=https://blog.google/products/gemini/google-gemini-deep-research/ \|access-date=2025-02-05 \|website=Google \|language=en-us}}</ref> a feature in Gemini that conducts multi-step research tasks. On December 16, 2024, an experiment using a [[Llama (language model)\|Llama]] 3B model demonstrated that by scaling test-time compute, a relatively small model could outperform a much larger Llama 70B model on challenging reasoning tasks. This result highlighted that improved inference strategies can unlock latent reasoning capabilities even in compact models.<ref>{{Cite web \|title=Scaling test-time compute - a Hugging Face Space by HuggingFaceH4 \|url=https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute \|access-date=2025-02-05 \|website=huggingface.co}}</ref>

Reasoning language model: Difference between revisions