Text-to-video model

This is an old revision of this page, as edited by The Original Benny C (talk | contribs) at 22:52, 27 December 2022 (This is a list and should be formatted as such. Also adds clarity.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Text-to-Video is a state of art technology which needs only text as input for outcome as video.The inspiration came from Text-to-image model which delivers images as output for text as input by CogVideo.[1]

Video prediction on making objects realistic in stable background by using Recurrent neural network for sequence to sequence model with connector Convolutional neural network encoding/decoding each frame pixel by pixel,[2] creating video using Deep learning.[3]

Methodology

Models

Different models are there Open source Artificial intelligence is CogVideo presented their code in GitHub.[4] Meta Platforms uses text2video with makeavideo.studio,.[5][6][7]Google used Imagen Video for converting text 2video,[8][9][10][11][12]

Antonia Antonova presented another model[13]

References

  1. ^ CogVideo, THUDM, 2022-10-12, retrieved 2022-10-12
  2. ^ "Leading India" (PDF).
  3. ^ Narain, Rohit (2021-12-29). "Smart Video Generation from Text Using Deep Neural Networks". Retrieved 2022-10-12.
  4. ^ CogVideo, THUDM, 2022-10-12, retrieved 2022-10-12
  5. ^ Davies, Teli (2022-09-29). "Make-A-Video: Meta AI's New Model For Text-To-Video Generation". W&B. Retrieved 2022-10-12.
  6. ^ Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt". Medium. Retrieved 2022-10-12.
  7. ^ "Meta's Make-A-Video AI creates videos from text". www.fonearena.com. Retrieved 2022-10-12.
  8. ^ "google: Google takes on Meta, introduces own video-generating AI - The Economic Times". m.economictimes.com. Retrieved 2022-10-12.
  9. ^ Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt". Medium. Retrieved 2022-10-12.
  10. ^ "Nuh-uh, Meta, we can do text-to-video AI, too, says Google". www.theregister.com. Retrieved 2022-10-12.
  11. ^ "Papers with Code - See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction". paperswithcode.com. Retrieved 2022-10-12.
  12. ^ "Papers with Code - Text-driven Video Prediction". paperswithcode.com. Retrieved 2022-10-12.
  13. ^ "Text to Video Generation". Antonia Antonova. Retrieved 2022-10-12.