Friday, August 10, 2018

Google Speech Recognition


  • Speech recognition is the ability of a machine or program to identify words and phrases and convert them to a machine-readable format. 
  • Speech recognition basically means talking to a computer, having it recognize what we are saying.
  • This process fundamentally functions as a pipeline that converts PCM (Pulse Code Modulation) digital audio from a sound card into recognized speech. Speech recognition technology has evolved for more than 40 years, spurred on by advances in signal processing, algorithms, architectures, and hardware. During that time it has gone from a laboratory curiosity to an art, and eventually to a full-fledged technology that is practiced and understood by a wide range of engineers, scientists, linguists, psychologists, and systems designers.



  • Google Speech Recognition enables developers to convert audio to text by applying powerful neural network models in an easy-to-use API. The API recognizes 120 languages and variants to support your global user base. You can enable voice command-and-control, transcribe audio from call centers, and more. It can process real-time streaming or prerecorded audio, using Google’s machine learning technology.



  • It applies the most advanced deep-learning neural network algorithms to audio for speech recognition with unparalleled accuracy. 


    APPLICATIONS:

  • In Car Systems.
  • Medical Care.
  • Military.
  • Smartphones.
  • People with Disabilities.

         

  There are many Virtual Assistant for different    software companies:
  • Alexa Echo (Amazon)
  • Cortana(Microsoft)
  • Google Assistant(Google)
  • Siri (Apple Inc.)



  • From the technology perspective, speech recognition has a long history with several waves of major innovations. Most recently, the field has benefited from advances in deep learning and big data. The advances are evidenced not only by the surge of academic papers published in the field, but more importantly by the worldwide industry adoption of a variety of deep learning methods in designing and deploying speech recognition.
  • GoogleIBMBaiduAppleAmazonNuance,SoundHound many of which have publicized the core technology in their speech recognition systems as being based on deep learning.


         
         If you enjoyed this blog post, share it with a friend!
         Next week I will post more about AI...so stay tuned!  







No comments:

Post a Comment

Computational Biology

<!-- Google Tag Manager --> <script>(function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start': new Date().getTime(),ev...