VideoPoet

Text-to-video model by Google

"A dog eating popcorn at the cinema"

"A teddy bear with a cap, sunglasses, and leather jacket playing drums"

Example videos generated by the model from texts

Developer(s)GoogleInitial releaseFebruary 8, 2024; 3 months ago (2024-02-08)TypeLarge language model

VideoPoet is a large language model developed by Google Research in 2023 for video making.^[1]^[2]^[3]^[4] It can be asked to animate still images.^[5] The model accepts text, images, and videos as inputs, with a program to add feature for any input to any format generated content.^[4] VideoPoet was publicly announced on December 19, 2023.^[1] It uses an autoregressive language model.

References

^ ^a ^b Krithika, K. L. (December 20, 2023). "Google Unveils VideoPoet, a New LLM for Video Generation". Analytics India Magazine. Retrieved April 29, 2024.
^ Kondratyuk, Dan; Yu, Lijun; Gu, Xiuye; Lezama, José; Huang, Jonathan; Hornung, Rachel; Adam, Hartwig; Akbari, Hassan; Alon, Yair; Birodkar, Vighnesh; Cheng, Yong; Chiu, Ming-Chang; Dillon, Josh; Essa, Irfan; Gupta, Agrim; Hahn, Meera; Hauth, Anja; Hendon, David; Martinez, Alonso; Minnen, David; Ross, David; Schindler, Grant; Sirotenko, Mikhail; Sohn, Kihyuk; Somandepalli, Krishna; Wang, Huisheng; Yan, Jimmy; Yang, Ming-Hsuan; Yang, Xuan; Seybold, Bryan; Jiang, Lu (December 21, 2023). "VideoPoet: A Large Language Model for Zero-Shot Video Generation". arXiv:2312.14125 [cs.CV].
^ "Google has introduced VideoPOET breaking new ground in coherent video generation". Gizmochina. December 21, 2023.
^ ^a ^b "VideoPoet". Google Research. Retrieved April 29, 2024.
^ Franzen, Carl (December 20, 2023). "Google's new multimodal AI video generator VideoPoet looks incredible". VentureBeat.

External links

Media related to VideoPoet at Wikimedia Commons

Google AI

Google
Google Brain
Google DeepMind

Computer programs

AlphaGo

Versions	AlphaGo (2015) Master (2016) AlphaGo Zero (2017) AlphaZero (2017) MuZero (2019)
Competitions	Fan Hui (2015) Lee Sedol (2016) Ke Jie (2017)
In popular culture	AlphaGo (2017) The MANIAC (2023)

Other

AlphaFold (2018)
AlphaStar (2019)
AlphaDev (2023)
AlphaGeometry (2024)

Machine learning

Neural networks	WaveNet (2016) Transformer (2017) Gato (2022)
Other	Quantum Artificial Intelligence Lab TensorFlow Tensor Processing Unit

Generative AI

Chatbots	Assistant (2016) Sparrow (2022) Gemini (2023)
Language models	BERT (2018) LaMDA (2021) Chinchilla (2022) PaLM (2022) Gemini (2023) VideoPoet (2024)
Other	Vids (2024)