Speech to text software android

7/11/2023

It's not even close to a fair fight but with the same implementation and settings a GTX 1070 is roughly 90x (nearly two orders of magnitude) faster than a Raspberry Pi. Slight tangent but there are users in the space who seem to be under the impression that they can use their Raspberry Pi for voice assistant/speech recognition.

Our goal is to provide an open-source commercial voice assistant equivalent user experience and that is and will be fundamentally impossible for the foreseeable future on CPU. Seriously - a GTX 1070 is at least 5x faster than a Threadripper 5955WX. With Willow Inference Server I'm constantly telling people: a six year old $100 Tesla P4/GTX 1070 walks all over even the best CPUs in the world for our primary task of speech to text/ASR - at dramatically lower cost and power usage. I find this interesting because everyone seems to take it as obvious that integrated graphics vs discrete graphics for gaming aren't even close. They seem to find it difficult to understand a fundamental reality: GPUs are so physically different and better suited to many/most ML tasks all the CPU tricks in the world cannot bring CPU even close to the performance of GPUs (while maintaining quality/functionality) for many tasks. There seems to be a large gap in understanding with many users.

We run into this constantly with Willow and the Willow Inference Server. Project mention: VLLM: 24x faster LLM serving than HuggingFace Transformers | |

0 Comments

Speech to text software android

Leave a Reply.

Author

Archives

Categories