Google Significantly Improves its Mobile Speech Recognition

Google has made some significant improvements to the speech recognizer on its mobile phones. The new software outputs every single character in real time and is entirely contained on the mobile device, which means that the dictation system will work offline with zero latency.

Johan Schalkwyk, a Google Fellow with the company’s Speech Team, explained the new system in a recent AI Blog post. According to Schalkwyk, more conventional speech recognition systems convert speech to text using a sequence that involves three separate steps, beginning with an analysis of an audio sample to identify specific sounds. The software then uses those sounds to form words and a language model to complete the sentence.

The drawback is that those traditional systems require a complete input sequence in order to generate a transcription. Google’s team used Recurrent Neural Network transducer (RNN-T) technology to convert audio input to text output on a character-by-character basis, improving speed by outputting each individual letter instead of a longer word or phrase.

The new platform is also smaller than its predecessors, reducing the speech recognizer footprint from 2 GB to 80 MB. At the former size, speech recognizers are too unwieldly to store on a mobile device and therefore require a network connection in order to function. The new dictation system is small enough to embed on a standard smartphone and will be available to customers on or offline.

For now, the new speech recognizer will only be available in American English on Pixel phones, though Google hopes to launch the service for more languages and devices soon. The announcement is the latest RNN breakthrough for the company’s speech recognition team, which achieved human parity back in 2017.

Source: Google AI Blog

(Originally posted on Mobile ID World)

Related News

Partners

FaceTec’s patented, industry-leading 3D Face Verification and Reverification software anchors digital identity, creating a chain of trust from user onboarding to ongoing authentication on all modern smart devices and webcams. FaceTec’s 3D FaceMaps™ finally make trusted, remote identity verification possible. As the only technology backed by a persistent spoof bounty program and NIST/iBeta Certified Liveness Detection, FaceTec is the global standard for 3D Liveness and Face Matching with millions of users on six continents in financial services, border security, transportation, blockchain, e-voting, social networks, online dating and more. www.facetec.com

Oz Forensics is the independent private vendor of robust, technology-based, and AI-powered liveness detection and face-matching solutions founded in 2017 and headquartered in Dubai, UAE. We confirm the security level of our solution by certifying the ISO-30107 Level 1 and 2 standards. https://ozforensics.com/

AuthenticID provides 100% automated identity verification and fraud detection solutions that are leveraged by companies worldwide, including 2 of the top 3 U.S. Banks, 8 out the top 10 wireless providers in North America, and 2 of the 3 credit bureaus. Using proprietary computer vision and machine learning technology, these solutions help companies accurately verify the identity of their users across retail, digital and call center environments for onboarding and ongoing re-authentication events; KYC, IAM, and more. The solutions are easy to integrate and provide customers a large ROI by stopping fraud losses, increasing customer conversion at onboarding, reducing operational costs and allowing quick and cost-effective operational scalability, all while ensuring global privacy regulations are complied with. https://www.authenticid.com/

Founded in 2007, Lakota Software Solutions is an American company with a world-renowned reputation for developing robust biometric software and systems. Our vendor-agnostic products are tailored to ensure compliance with ANSI/NIST-ITL standards and EBTS specifications, facilitate seamless integration with other biometric systems, and optimize accuracy, cost-effectiveness, and scalability. https://lakotasoftware.com/

Identity Week aims to be a significant identity industry catalyst. It’s our mission is to help accelerate the move towards a world where trusted identity solutions enable governments and commercial organisations to provide citizens, employees, customers and consumers with a multitude of opportunities to transact in a seamless, yet secure manner. All the while preventing the efforts of those intent on doing harm. https://identityweek.net/

The Biometric Digital Identity Prism is a market landscape framework designed to help influencers and decision makers understand, innovate, and implement digital identity technologies and solutions. This innovative framework for understanding and evaluating the rapidly evolving biometric digital identity marketplace is the only market model that is truly biometric-centric based on the foundational conviction that in the age of digital transformation the only true, reliable link between humans and their digital data is biometrics. https://www.the-prism-project.com

Related News

Footer

Follow Us