Complete silence is always hallucinated as "ترجمة نانسي قنقر" in Arabic

github.com

510 points by edent 15 hours ago


cyp0633 - 14 hours ago

The same happens with whisper-large-v3 on Chinese transcription: silence is transcribed to something like "please upvote, share and favourite this video". I suspect they trained the model on some random YouTube video without carefully picking really useful data.