Analyze images to generate detailed prompts
Animate a portrait from audio speech
Clone voices from audio files or microphone input