HACKER Q&A
📣 lovelearning

Any technical reasons Google Docs can't do voice typing in Firefox?


Google Docs enables voice typing when opened in Chrome. But not in Firefox.

Does somebody here know the technical reasons?

I find speech-to-text dictation a productive way to crank out a draft. So far, voice typing is the easiest approach on Linux that I've found. But I'd prefer Firefox. Any other suggestions for easy STT on Linux? Punctuation identification is preferred but optional. It's a Kubuntu system with only CPU, no GPU.


  👤 SahAssar Accepted Answer ✓
IIRC every browser that supports the Web Speech API does so via cloud services. Mozilla being the only major browser engine maker without it's own "cloud" and having slightly fewer phone-home features than many of the others didn't want to do that. Mozilla has been doing quite a bit of work in the area though (for example https://github.com/mozilla/DeepSpeech), hopefully to enable these features locally in the future.

IMO it's good if we don't make web platform features unfeasible to implement locally, so I think mozillas stance makes sense. A new browser should not have to have the backing of a massive cloud provider willing to give away compute for free.


👤 fxtentacle
It's purely business reasons. Also in Chrome, the STT runs in the Google cloud anyway.

And just FYI I'm working on turning my state of the art speech recognition paper (TEVR, token entropy variance reduction) into a developer-friendly binary which will run offline on your GPU and, hence, be fully private, and offer a scripting API on localhost. Testing WER on LibriSpeech clean is 2.3, so slightly worse than an offline wav2vec2 1B, but years ahead of Kaldi, Vosk, coqui and the usual streaming cloud services.

I estimate it'll be 2 more months until I can post Linux binaries. Won't be open source, though.


👤 sillystuff
Nerd dictation is a purely on-device speech to text program that works pretty well if your computer is fast enough.

https://github.com/ideasman42/nerd-dictation

get speech models here:

https://github.com/alphacep/vosk-api

HN discussion:

https://news.ycombinator.com/item?id=29972579


👤 FrenchDevRemote
AFAIK there is no technical reasons, just business reasons.

Google systematically abuse their position to make Firefox less appealing to people


👤 b20000
you can’t even do basic shit in a productive manner in google docs so what are you worrying about?

👤 midislack
Technically Google wants you on Chrome.