Transcribe and summarize YouTube videos or audio files
Generate voice covers from audio or text input