Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
Discover similar tools to enhance your workflow
Open Source GPT -3 Powered CLI The current prompt length is ~840 tokens and the pricing for text-...
Jusi is an AI-powered tool that enables businesses to bring their ideas to life faster and more a...
Create audio files for commercial use. Offers features such as voice effects, pauses, speed, pitc...