Last time I tried whisper, it hallucinated an elaborate conversation from sounds of slapping and moaning and it took minutes to spit every single line of it.
I would love your feedback and suggestions for new improvements or features you wanna have, either in the source available version, the desktop app or blog post itself?
ref: https://www.cpubenchmark.net/compare/4585vs4245/Apple-M1-Max...
You might want to add something like yolo finetune to detect scenes + face recognition too.