You must log in or # to comment.
Whisper is what I think is one of the best uses for machine learning.
Recognition in general is the main thing it’s powerful for. Speech to text, OCR, etc.
deleted by creator
Whisper struggles with non-english languages and background noise - you might get better results using the larger models (medium/large) with a lower temperature setting to reduce the hallucinations and repetitions your experiencing.
Check out Voxtral Mini or Small if you have the GPU for it. It works really well on English but it comes from French company so I would be surprised if French doesn’t work well as well.
deleted by creator