FlexSR

Flexible speech recognition system.

High accuracy regardless of dialect, accent or non-ideal speech Faster, more robust and tolerant of background noise Ideal for multi-user environments Computationally lightweight

 

Company Name

Oxford University Innovation

Reference Number 1958

Market Sectors:

Devices, Education, Electronics, IT & Telecoms

Operational Areas

IT & Telecoms
 

Commercial Solutions

Increase Efficiency
 

Speed to Market

Immediate
 

Say It Again?

Is the ‘a’ in bath like bar or like bat? A small difference, but in reality every person pronounces every word differently, even when they repeat themselves.

Statistically Speaking

As a result, most ASR systems, which are generally based on statistical-modelling techniques, require extensive training from thousands of recorded speakers just to master the variation within one dialect. Oxford’s FlexSR system outperforms many existing ASR systems at individual word recognition, and its lightweight nature is ideally suited to integration into existing technologies or for mobile deployment.

Key Benefits of Oxford’s FlexSR:

  • High accuracy regardless of dialect, accent or non-ideal speech
  • Faster, more robust and tolerant of background noise
  • Ideal for multi-user environments
  • Computationally lightweight
  • Potential for mobile deployment
  • Easily adaptable to any spoken language (currently it is implemented for English and German), including tonal languages
  • No system training required

Linguistic Model

For standard speech recognition software, high degrees of accuracy are only achieved with multi-layered and computationally-intensive models, requiring either state-of-the-art hardware, or in the case of mobile applications, a network connection to offload the analysis. In addition many systems also need to be trained against a particular voice to attain accurate recognition (although some might suggest that it is the speaker that is trained how to speak, not the software how to recognise!)

FlexSR is different. Rather than rely on statistical analysis alone, leading linguists at the University of Oxford developed a “sparse” linguistic model of the human cognitive representation of words. This theory suggests that humans store a very basic acoustic representation of each word, accepting wide variation in the sounds themselves and recognising words by their general pattern. Adopting this approach allows FlexSR to identify words across a wide range of speakers and dialects by extracting approximate sounds and matching these patterns with its internal word list or lexicon.

Easy Integration

Given the potential impact of this new approach and the broad range of applications, Isis Innovation welcomes discussions with potential development or integration partners.

Please confirm you wish to exchange contact information with the innovator of FlexSR.

You currently have 0 company connections available this month.

Confirm