Text-to-video model: Difference between revisions

Content deleted Content added
A serene animated depiction of Lord Krishna standing in a mystical Vrindavan at sunset, radiating a divine glow. With a calm and compassionate expression, he shares profound wisdom on life, death, and rebirth. Dressed in royal blue and gold, he gracefully moves his hands while speaking in a soothing voice, reassuring souls of their immortality and the importance of detachment, accompanied by gentle flute music in the background.
Tags: Reverted Visual edit Mobile edit Mobile web edit
Reverted 1 edit by Nishanttyagi1111 (talk): Rv giant short desc
Line 1:
{{short description|Machine learning model}}
{{short description|A serene animated depiction of Lord Krishna standing in a mystical Vrindavan at sunset, radiating a divine glow. With a calm and compassionate expression, he shares profound wisdom on life, death, and rebirth. Dressed in royal blue and gold, he gracefully moves his hands while speaking in a soothing voice, reassuring souls of their immortality and the importance of detachment, accompanied by gentle flute music in the background.}}
{{Use dmy dates|date=November 2024}}
[[File:OpenAI Sora in Action- Tokyo Walk.webm|thumb|upright=1.35|Generate aA video thatgenerated isusing AOpenAI's serene[[Sora animated(text-to-video depictionmodel)|Sora]] oftext-to-video Lordmodel, Krishnausing standingthe inprompt: a<code>A mysticalstylish Vrindavanwoman atwalks sunset, radiatingdown a divineTokyo glow.street Withfilled awith calmwarm glowing neon and compassionateanimated expression,city hesignage. sharesShe profoundwears wisdoma onblack life,leather deathjacket, anda rebirth.long Dressedred indress, royaland blueblack boots, and gold,carries hea gracefullyblack movespurse. hisShe handswears whilesunglasses speakingand inred alipstick. soothingShe voice,walks reassuringconfidently soulsand ofcasually. theirThe immortalitystreet is damp and thereflective, importancecreating ofa detachment,mirror accompaniedeffect byof gentlethe flutecolorful musiclights. inMany thepedestrians walk backgroundabout.</code>]]
A '''text-to-video model''' is a [[machine learning model]] that uses a [[natural language]] description as input to produce a [[video]] relevant to the input text.<ref name="AIIR">{{cite report|url=https://aiindex.stanford.edu/wp-content/uploads/2023/04/HAI_AI-Index-Report_2023.pdf|title=Artificial Intelligence Index Report 2023|publisher=Stanford Institute for Human-Centered Artificial Intelligence|page=98|quote=Multiple high quality text-to-video models, AI systems that can generate video clips from prompted text, were released in 2022.}}</ref> Advancements during the 2020s in the generation of high-quality, text-conditioned videos have largely been driven by the development of video [[diffusion model]]s.<ref>{{cite arXiv |last1=Melnik |first1=Andrew |title=Video Diffusion Models: A Survey |date=2024-05-06 |eprint =2405.03150 |last2=Ljubljanac |first2=Michal |last3=Lu |first3=Cong |last4=Yan |first4=Qi |last5=Ren |first5=Weiming |last6=Ritter |first6=Helge|class=cs.CV }}</ref>