TheTwelveYearOld@lemmy.world to Linux@lemmy.worldEnglish · 18 days agoFFmpeg 8.0 merges OpenAI "Whisper Filter" for automatic speech recognition, Vulkan AV1 encoding, & VP9 decodingwww.phoronix.comexternal-linkmessage-square6linkfedilinkarrow-up11arrow-down10file-textcross-posted to: linux@programming.devlinux@lemmy.mlopensource@programming.dev
arrow-up11arrow-down1external-linkFFmpeg 8.0 merges OpenAI "Whisper Filter" for automatic speech recognition, Vulkan AV1 encoding, & VP9 decodingwww.phoronix.comTheTwelveYearOld@lemmy.world to Linux@lemmy.worldEnglish · 18 days agomessage-square6linkfedilinkfile-textcross-posted to: linux@programming.devlinux@lemmy.mlopensource@programming.dev
https://www.phoronix.com/news/FFmpeg-Vulkan-AV1-Encoding https://www.phoronix.com/news/FFmpeg-Lands-Whisper
minus-squareMysteriousSophon21@lemmy.worldlinkfedilinkEnglisharrow-up0·17 days agoWhisper struggles with non-english languages and background noise - you might get better results using the larger models (medium/large) with a lower temperature setting to reduce the hallucinations and repetitions your experiencing.
Whisper struggles with non-english languages and background noise - you might get better results using the larger models (medium/large) with a lower temperature setting to reduce the hallucinations and repetitions your experiencing.