In a few short months, the world of generative AI has evolved from chatbots like ChatGPT to photorealistic images created by Midjourney. In case you thought things were slowing down any time soon, AI-generated videos might be about to have their big moment in the spotlight.
As highlighted by a tweet from Philipp Tsipman, the founder of a marketing platform for such videos, there have been five new AI video generators launched in just the last seven days. The quality varies, but it’s clear that the technology is moving at a relentless pace. So, let’s break down the five AI video generators, which range from janky to seriously impressive.
Runway has been leading the pack in publicly available AI editing. Masking, motion-tracking, super slow-motion, and color grading all benefit from AI assistance. This is a subscription-based service, so it has extra incentive to push the limits.
In February, Runway started testing generative video with its Gen-1 update, which lets you restyle a video with words. A video of a walk down the street becomes an animated short that appears to be made with hand-sculpted and posed Claymation characters.
Runway released Gen-1 publicly today and is already testing Gen-2, eliminating the need for a reference video. Soon, Runway will let you create a video with a simple text prompt.
You might recognize Picsart as a free photo editor, but it also offers video editing. Text2Video-Zero is a Picsart research tool that uses the Stable Diffusion AI image generator to create multiple frames with enough in common to look cohesive.
This is significant progress. as early experiments at assembling AI images as video frames produced chaotic scenes where every detail constantly shifted. The effect was eye-catching, but distracted from the content.
Video-P2P is an open-source project similar to Runway Gen-1. It takes video input and uses Stable Diffusion to style it with text prompts. The examples shown look good, with steady backgrounds and consistency in the altered elements.
TemporalNet is based on ControlNet, an advanced AI image generator that allows more control of the poses of figures that appear in pictures.
The videos have some flicker, but the results are promising. ControlNet took a big step toward producing predictable scenes, and TemporalNet might provide a nice alternative method of creating videos.
Text-to-video is an AI model created by Alibaba. It’s clear that this model needs a bit more work, but having more options to explore is always a good thing.
The rapid pace of AI innovation is phenomenal. Just months ago, we were teased with glimpses of the potential of similar videos created in AI labs. In 2022, one of the first public AI video generators simply superimposed tiny AI images over stock footage.
Those “old school” methods, from a few months ago, are laughable when you see how far AI has progressed in such a short amount of time.