Phenaki is an AI model to generate videos that can be multiple minutes long straight from text. You can also generate video from a still image and a prompt. The proposed video encoder-decoder outperforms all per-frame baselines currently used in the literature in terms of spatio-temporal quality and number of tokens per video. To generate video tokens from text, they are using bidirectional masked transformer conditioned on pre-computed text tokens. The generated video tokens are subsequently de-tokenized to create the actual video.
Discover similar tools to enhance your workflow
Flawless is a company that is using advanced technology and artificial intelligence to revolution...
Genmo offers fantastical video generation with AI. You can also see videos generated by the commu...
Create AI videos by simply typing in text. Easy to use, cheap and scalable. Make engaging videos ...