How to create story-like videos with transformers β and no diffusion models are involved! In this video, we explain the Phenaki paper from Google Brain. π§
βΊ Sponsor: Tasq.ai π https://www.tasq.ai/
πΊ Imagen Video and Make-A-Video explained: https://youtu.be/AcvmyqGgMh8
πΊ Stable Diffusion explained: https://youtu.be/J87hffSMB60
πΊ Diffusion models explained: https://youtu.be/344w5h24-h8
Check out our daily #MachineLearning Quiz Questions: https://www.youtube.com/c/AICoffeeBreak/community
π Villegas, Ruben, Mohammad Babaeizadeh, Pieter-Jan Kindermans, Hernan Moraldo, Han Zhang, Mohammad Taghi Saffar, Santiago Castro, Julius Kunze, and Dumitru Erhan. "Phenaki: Variable Length Video Generation From Open Domain Textual Description." arXiv preprint arXiv:2210.02399 (2022). https://arxiv.org/abs/2210.02399
π Phenaki website: https://phenaki.github.io/
Thanks to our Patrons who support us in Tier 2, 3, 4: π
Dres. Trost GbR, Edvard GrΓΈdem, Vignesh Valliappan, Mutual Information, Mike Ton
Outline:
00:00 Phenaki
00:56 Tasq.ai [Sponsor]
02:18 The problem of long video training data
04:02 Phenaki in a nutshell
06:22 First component: The C-ViViT architecture explained
09:25 Second component: MaskGIT in Phenaki
ββββββββββββββββββββββββββ
π₯ Optionally, pay us a coffee to help with our Coffee Bean production! β
Patreon: https://www.patreon.com/AICoffeeBreak
Ko-fi: https://ko-fi.com/aicoffeebreak
ββββββββββββββββββββββββββ
π Links:
AICoffeeBreakQuiz: https://www.youtube.com/c/AICoffeeBreak/community
Twitter: https://twitter.com/AICoffeeBreak
Reddit: https://www.reddit.com/r/AICoffeeBreak/
YouTube: https://www.youtube.com/AICoffeeBreak
#AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #researchβ
Did you miss our previous article...
https://techvideos.club/information-technology/artifitial-intelligence-machine-learning-and-deep-learning