I use it for generating subtitles. It figures out context, it ignores stuttering, it does punctuation etc. It’s really is just better. With clean audio it transcribes like a human does.
It does better than other techniques with dirty audio, but when it fails it fails weird, which is the big issue here.
You need an editor for traditional transcription tools too :) and it’s A LOT more work. They don’t even do punctuation or names.