NEWS Intel has released software that lets computers read lips, a step forward that could lead to better voice recognition applications. The Audio Visual Speech Recognition (AVSR) software tracks a speaker's face and mouth movements. By matching these movements with speech, the application can provide a computer with enough data to respond to voice recognition commands, even when these are given in noisy environments. The AVSR program is part of the OpenCV computer vision library, a collection of open-source applications and tools that help computers interpret visual data. Computer companies have tried to popularise voice recognition applications for years, but have been stymied by a shortfall in processing power in most computers and by the restricted performance of their software. Both of these factors are changing rapidly. Average processors now run at over 1.5GHz, while top-of-the-line chips run at 3GHz. Additionally, researchers are getting a better handle on how to write applications that will work with voice commands. One way to improve such applications is to put a visual signal into the voice recognition scheme as Intel is doing. Microsoft Research, for example, has developed a prototype application called GWindows, which a person can use to scroll through files or move windows though a combination of voice commands and hand gestures, said Andy Wilson, the project's designer. With GWindows, a video camera mounted on a television monitor follows moving objects, such as a hand or pointer, that come within 20 inches of the screen. The application interprets any hand movements (or pointer gestures) as computer commands: Placing a finger over a window and then moving a finger left will move the window left, for example. If a voice command such as "scroll" is given, the computer will combine the finger and voice commands and scroll down. No special gloves are needed. Microsoft's prototype application works better than a simple voice recognition system because the gestures improve accuracy, according to Wilson, who has demonstrated that the computer can follow voice commands in a crowded room filled with multiple conversations and lots of interference. Such visual signal software relies in part on Bayesian mathematics, which is influencing other interface and artificial intelligence projects at Microsoft. In Bayesian maths, computers essentially rely on statistics. If a computer "sees" a sweeping hand gesture toward the left a number of times, it will consistently interpret that gesture as a command to move a file toward the left. Intel has other visual applications to AVSR in the works. The chip giant is looking into an application that uses cameras to monitor hospital patients for risk of strokes and into software that uses a security camera feed to detect potential criminals in a parking lot. The underlying principles of these programs are the same: The computer sends an alert when it sees something unusual - a slowing in a patient's gait or a person going from car to car instead of into the mall - in its video stream. The work on these applications and the development of AVSR is taking place at Intel's China Research Center in Beijing. In other Intel software research news, the company has released a test version of a technical library for building Bayesian networks, said Gary Bradski, a senior researcher in Intel's Microprocessor Research Labs who helped create the OpenCV library. A final version of the technical library, called the Probability Network Library, will come out by the end of the year, he said. Michael Kanellos writes for News.com
Intel sets sights on lip-reading software
Could improve voice recognition technology
Post your comment
In order to post a comment you need to be registered and logged in.
You can also log in with Facebook. Log in or create your silicon.com account below
Latest Software stories
Get silicon.com's daily newsletter
-

Enter your email to register
Featured white papers
-
Systems engineering: Best practice for development success
Systems engineering isn't just a technical activity in the product lifecycle—it determines the commercial viability of...
-
Use product development for competitive advantage
Remember when MP3 players just played music? Today, consumers want players that can host music, stream video, support...
-
How to Communicate More Effectively at Work
We're constantly being held back by the tools and processes that were supposed to revolutionise our workday. Email...
Popular Software stories
Keep in touch with silicon.com
-
Connect with silicon.com on Facebook
Discuss the news of the day with the silicon.com team
-
Follow silicon.com on Twitter
Get regular updates from the silicon.com editors
-
Join the silicon.com LinkedIn networking group
Network with your peers and share expertise
Latest jobs
-
Project Manager
Black Rock Studio [A division of Disney Interactive Media Group] is currently recruiting for a Project Manager to...
-
Business Analyst ( ISEB, CBAP, BA, Analyst)
Business Analyst ( ISEB, CBAP, BA, Analyst) £31,000-£42,000 + excellent benefits We take the best Business...
-
Head of Financial Accounts
A large and forward thinking NHS organisation at the forefront of the NHS change agenda currently seeks an Interim...
silicon.com newsletters
-
Stay up to date with silicon.com newsletters
Keep up with the latest news and analysis from silicon.com with our free email newsletters





