I tried to use DeepSpeech to transcribe last Friday's stream for editing purposes. It came out reading as though it were from a Markov bot.
"it is again stir so one of their god lichonin ... and had you know six in the morning the spies they came to me that quickly but he realized it was it was this and the box tom that is a piece of part ... god blue desperate busy sea libraries the niobrara with lucid programming hospitally not cluttered the bissextile"
@jakob I was surprised to find that DeepSpeech doesn't seem to be using of a good language model, as it frequently produces obscure words and even weird combination of letters. At last, I found vosk from alphacep. It's quite accurate even when used by a non-native speaker
@jakob I find the test_microphone.py example a good place to start, sample rate and format are handled, so I don't need to worry about converting codec or getting timing wrong. I think the first time I talked into the microphone, words got recognized one by one with little latency, also the partial result sometimes changes to make it more likely to be a English sentence
The social network of the future: No ads, no corporate surveillance, ethical design, and decentralization! Own your data with Mastodon!