Replies: 1 comment 1 reply
-
AFAIK the original training material is kept under wraps, but it might be that the word "compass" just isn't sufficiently presented there. EDIT: Just ran some tests, "compass" was transcribed just fine when it was a part of a speech. And come to think of it, there's the word "compassion" which is more common and probably uses the same tokens. So the word's in the model alright, but isn't always detected when it's standing on its own (bigger models fare better in this regard, of course). |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I'm building my own "error dictionary", that converts single words returned by Whisper, that are not English. (I'm using ggml-base.en.bin) I get words that make no sense. This seems to occur most frequently when I am using Whisper as a command processor, where I speak a single word. I get these words in a "clean" environment, that is, there is no background noise. Here's an example:
This is my error dictionary entry for "compass"
Some of these words I cannot even find in a Google search. Is this because the huge but weaky trained Hugging Face models contains these tokens?
Beta Was this translation helpful? Give feedback.
All reactions