Revision as of 19:53, 21 April 2025 edit Dimawik (talk \| contribs) Extended confirmed users 2,445 edits →top: Expanding article ← Previous edit		Revision as of 19:59, 21 April 2025 edit undo Dimawik (talk \| contribs) Extended confirmed users 2,445 edits m →top: Copyedit (minor) Next edit →
Line 1: {{in use}} A '''1.58-bit Large Language Model''' ('''1.58-bit LLM''') is a version of a [[large language model]] with weights using only three values: -1, 0, and +1. This restriction allows the model to replace costly multiplications with additions and reduce the storage memory. Since the end-task performance and [[perplexity]] of 1.58-bit LLMs are close to their "full precision" (16-bit [[FP16]] or [[BF16]]) counterparts, this design allows reaching the same [[artificial intelligence]] goals with much lower hardware requirements, latency, and training effort.{{sfn\|Ma\|Wang\|Ma\|Wang\|2024\|p=1}} ==References==

1.58-bit large language model: Difference between revisions