Crime fighting potential for computerised lip-reading

The three-year project, which starts next month, will collect data for lip-reading and use it to create machines that automatically convert videos of lip-motions into text.

It builds on work already carried out at UEA to develop state-of-the-art speech reading systems.

The university is teaming up with the Centre for Vision, Speech & Signal Processing at Surrey University, who have built accurate and reliable face and lip trackers, and the Home Office Scientific Development Branch, who want to investigate the feasibility of using the technology for crime fighting.

The team also hope to carry out computerised lip-reading of other languages.

While it is known that humans can and do lip-read, not much is known about exactly what visual information is needed for effective lip-reading. Human lip-reading can be unreliable, even using trained lip-readers.

Dr Richard Harvey, senior lecturer at UEA’s School of Computing Sciences, is leading the project, which has been awarded £391,814 by the Engineering and Physical Sciences Research Council.

“We all lip read, for example in noisy situations like a bar or party, but even the performance of expert lip readers can be very poor,” he said.

“It appears that the best lip-readers are the ones who learned to speak a language before they lost their hearing and who have been taught lip-reading intensively. It is a very desirable skill.”

Dr Harvey added: “The Home Office Scientific Development Branch is interested in anything that helps the police gather information about criminals or gather evidence.”

As well as crime fighting there could be other potential uses for the technology, such as installing a camera in a mobile phone, or on the dash board for in-car speech recognition systems.

Another reason for developing computerised lip-reading is that the number of trained lip-readers is falling, mainly because people tend to be taught to sign instead.

Dr Harvey said: “To be effective the systems must accurately track the head over a variety of poses, extract numbers, or features, that describe the lips and then learn what features correspond to what text.

“To tackle the problem we will need to use information collected from audio speech. So this project will also investigate how to use the extensive information known about audio speech to recognise visual speech.

“The work will be highly experimental. We hope to have produced a system that will demonstrate the ability to lip-read in more general situations than we have done so far.”

Media Contact

Press Office alfa

More Information:

http://www.uea.ac.uk

All latest news from the category: Information Technology

Here you can find a summary of innovations in the fields of information and data processing and up-to-date developments on IT equipment and hardware.

This area covers topics such as IT services, IT architectures, IT management and telecommunications.

Back to home

Comments (0)

Write a comment

Newest articles

Long-sought structure of powerful anticancer natural product

…solved by integrated approach. A collaborative effort by the research groups of Professor Haruhiko Fuwa from Chuo University and Professor Masashi Tsuda from Kochi University has culminated in the structure…

Making a difference: Efficient water harvesting from air possible

Copolymer solution uses water-loving differential to induce desorption at lower temperatures. Harvesting water from the air and decreasing humidity are crucial to realizing a more comfortable life for humanity. Water-adsorption…

In major materials breakthrough

UVA team solves a nearly 200-year-old challenge in polymers. UVA researchers defy materials science rules with molecules that release stored length to decouple stiffness and stretchability. Researchers at the University…