Doing to images what it does to words
By Tom Krazit
Published: 22 June 2009 16:14 GMT
Google thinks it has made a breakthrough in "computer vision".
Imagine stumbling upon a picture of a beautiful landscape filled with ancient ruins, one you didn't recognise at first glance while searching for holiday destinations online. Google has developed a way to let a person provide Google with the URL for that image and search a database of more than 40 million geotagged photos to match that image to verified landmarks, giving you a destination for that next trip.
The project is still very much in the research stage, said Jay Yagnik, Google's head of computer vision research. The company plans to present a paper today at the Computer Vision and Pattern Recognition Conference in Miami detailing its work in proving that large, scattered sets of data can be used to make accurate assessments of individual images.
Yagnik said: "This is a fundamental advancement in how we look at computer vision."
To create the "landmark recognition engine", Google took advantage of the 40 million or so images in Picasa and Panoromio that were geotagged with the locations of famous landmarks, like the Eiffel Tower. It also assembled images from travel guide sites such as Wikitravel as a base of landmark photos that had been verified by experts.
With all that data as a backdrop, researchers figured out a way to find the most representative pictures of a landmark using a clustering technique to group images taken from similar perspectives.
Then, when given a fresh image to analyse, the system uses pixel-matching techniques to find small patterns within that image and look for similar patterns within verified photos of landmarks. Google said it has been able to return an accurate result 80 per cent of the time, not only naming the landmark but allowing it to supply additional information about the place.
Google is by no means certain when, or if, this research will turn into a product. It is excited, however, that it has found a way to use computers to process large sets of data available on the internet and return accurate information about images; doing this with text, of course, is what has made Google, Google.
Original article: Google's vision improving for image search from CNET News.com
We are looking for an enthusiastic graduate with an interest in areas such as artificial intelligence/ pattern recognition/ machine vision /image ...
C++ Software Engineer- Computer Vision/ Image Processing – London Salary: Up to 45k We currently have an opening for a C++ Software Engineer ...
Senior .NET, C#, Web Systems Architect, Principle Technical Lead sought for Innovation and strategic technical Vision. Security and Transaction ...
Agenda Setters 2009
Welcome to the ninth annual Agenda Setters poll – silicon.com's list of the top 50 most influential individuals in the technology and IT industries, from techies and CIOs to entrepreneurs and business leaders. Find out more in our latest special report.
Stories from the web...
Copyright © 2008 CBS Interactive Limited. All rights reserved. Top of page
Petra Papinniemi
Legal Eye: Ecommerce held back by outdated laws
No wonder no one's buying...
Matthew Cushen
E-tailers: Be choosy overseas
Markets are not always what they seem
Tim Ferguson
'If you look at iPlayer from a distance, it's still very web 1.0'
Q&A: Erik Huggers, director, BBC's Future, Media and Technology
Kit Burden
Legal Eye: Tech could brighten retailers' gloom
Regulation and recession loom
Matthew Cushen
Retailers: Look to emerging markets
Comment: Massive opportunities if you get the IT right
Julian Goldsmith
How Zavvi lost its Virginity
IT director Tony Johnson on the retailer's changing web strategy