Upload an image or a short video to condition on, write an English prompt and press Generate. GPU highly recommended.