This page contains a Flash digital edition of a book.
XXXX


DIGITAL DICTATION, DOCUMENTS & DATA


Big data helping speech recognition become mainstream


Steve Young, Professor of Information Engineering at the University of Cambridge, and a global expert in speech recognition technologies, gives his thoughts on the advances and challenges facing this ‘growing’ research area. David Stevenson reports.


of the Institute of Electrical and Electronics Engineers (IEEE) James L Flanagan Speech and Audio Processing Award.


U


The annual prize is given to an individual or small team for “an outstanding contribution to the advancement of speech and/or audio signal processing”. For the last 35 years, Young, who is Professor of Information Engineering at the University, has focused his attention on developing systems that allow humans to interact with machines using voice.


He told NHE that research in this area made steady but not spectacular progress from the mid-1980s to the mid-2000s. “But over the last five to 10 years we’ve seen really quite significant acceleration in progress,” he said. “And that is why we are now seeing speech recognition coming into the mainstream with services like Apple Siri and Google Now, and the new smart watches that do speech recognition.”


Prof Young, who is the senior pro-vice- chancellor responsible for planning and resources at Cambridge, added that modern systems are built on the notion of building statistical models that represent the data.


72 | national health executive Sep/Oct 14


niversity of Cambridge’s Professor Steve Young will be the 2015 recipient


“So the way you build a speech recogniser, essentially, is that you get some data, which is people speaking, you transcribe the data and then you try to model the data and find a way to automatically generate the transcriptions yourself – and then you have a speech recogniser,” he said. “The key to all of that is some quite sophisticated statistical modelling algorithms and the availability of the data.”


Big data


The expert told us that it is the nature of data, and its wide availability nowadays, that has changed the speech recognition landscape. “When you speak into your phone, the signal is being routed to a server farm somewhere in North Virginia if you’re Apple or the Arizona desert if you’re Google, and it is being processed there and the result is being fed-back to your phone,” said Prof Young.


This allows two things to happen. Firstly, it unleashes the possibility of using some very powerful computing to recognise people’s voices. Then secondly, and more importantly, the companies are capturing the data.


“When Siri was first launched, for example, it wasn’t that great,” said Prof Young, “but as


more people started using it the company was capturing huge amounts of data. And then by using and collecting the data and upgrading the models, people found the recognition improved so they used the system more, so they gave more data. That has happened over a wide range of fields, and it is the ‘big data’ paradigm that we are hearing a lot about.”


He added that the internet is also allowing organisations, be it research or commercial, to collect huge datasets and do things that they could never do before, and that is what has led to a rapid improvement in performance and an “explosion of interest” in the field.


“It is nice, looking back, that we’ve had these bursts of speech recognition research being a very hot topic, then people being disillusioned with it and it going out of fashion, and then coming back,” added Prof Young. “And we are in one of those phases where it is back and people are very interested, especially with the big players investing huge amounts in improving services.”


Dictation and voice recognition in healthcare


Dictation in the medical area has been one


Page 1  |  Page 2  |  Page 3  |  Page 4  |  Page 5  |  Page 6  |  Page 7  |  Page 8  |  Page 9  |  Page 10  |  Page 11  |  Page 12  |  Page 13  |  Page 14  |  Page 15  |  Page 16  |  Page 17  |  Page 18  |  Page 19  |  Page 20  |  Page 21  |  Page 22  |  Page 23  |  Page 24  |  Page 25  |  Page 26  |  Page 27  |  Page 28  |  Page 29  |  Page 30  |  Page 31  |  Page 32  |  Page 33  |  Page 34  |  Page 35  |  Page 36  |  Page 37  |  Page 38  |  Page 39  |  Page 40  |  Page 41  |  Page 42  |  Page 43  |  Page 44  |  Page 45  |  Page 46  |  Page 47  |  Page 48  |  Page 49  |  Page 50  |  Page 51  |  Page 52  |  Page 53  |  Page 54  |  Page 55  |  Page 56  |  Page 57  |  Page 58  |  Page 59  |  Page 60  |  Page 61  |  Page 62  |  Page 63  |  Page 64  |  Page 65  |  Page 66  |  Page 67  |  Page 68  |  Page 69  |  Page 70  |  Page 71  |  Page 72  |  Page 73  |  Page 74  |  Page 75  |  Page 76  |  Page 77  |  Page 78  |  Page 79  |  Page 80  |  Page 81  |  Page 82  |  Page 83  |  Page 84  |  Page 85  |  Page 86  |  Page 87  |  Page 88  |  Page 89  |  Page 90  |  Page 91  |  Page 92  |  Page 93  |  Page 94  |  Page 95  |  Page 96  |  Page 97  |  Page 98  |  Page 99  |  Page 100  |  Page 101  |  Page 102  |  Page 103  |  Page 104