Microsoft’s speech recognition system is now as good as a human

Microsoft's speech recognition system is now as good as a human  #ai

  • The company’s conversational speech recognition system has finally reached an error rate of only 5.1 percent, putting it on par with the accuracy of professional human transcribers for the first time ever.
  • A year ago, the Microsoft’s speech and dialog research group refined its system to reach a 5.9 percent word error rate.
  • This was generally considered to be the average human error rate, but further work by other researchers suggested that 5.1 percent was closer to the mark for humans professionally transcribing speech heard in a conversation.
  • For over 20 years, a collection of recorded phone conversations known as Switchboard has been used to test speech recognition system for accuracy.
  • To reduce the system’s error rate by about 12 percent from last year’s benchmark results, the team incorporated a series of improvements into its neural net-based acoustic and language models.

Microsoft researchers have hit a milestone 25 years in the making. The company’s conversational speech recognition system has finally reached an error rate of only 5.1 percent, putting it on par with the accuracy of professional human transcribers for the first time ever.
Continue reading “Microsoft’s speech recognition system is now as good as a human”