AnimateDiff turns a text prompt into a video using a Stable Diffusion model. You can think of it as a slight generalisation of text-to-image: Instead of generating an image, it generates a video.