New artificial intelligence software boosts web searches

Washington, Nov 19:

Scientists have created artificial intelligence software that uses photos to locate documents on the Internet with far greater accuracy than ever before.

The new system, which was tested on photos and is now being applied to videos, shows for the first time that a machine learning algorithm for image recognition and retrieval is accurate and efficient enough to improve large-scale document searches online.

The system developed by researchers at Dartmouth College, Tecnalia Research and Innovation and Microsoft Research Cambridge, uses pixel data in images and potentially video rather than just text to locate documents.

It learns to recognise the pixels associated with a search phrase by studying the results from text-based image search engines.

The knowledge gleaned from those results can then be applied to other photos without tags or captions, making for more accurate document search results.

Images abound on the Internet and our approach means theyll no longer be ignored during document retrieval, said Associate Professor Lorenzo Torresani, co-author of the study.

Over the last 30 years, the Web has evolved from a small collection of mostly text documents to a modern, gigantic, fast-growing multimedia dataset, where nearly every page includes multiple pictures or videos. When a person looks at a Web page, she immediately gets the gist of it by looking at the pictures in it, said Torresani.

Yet, surprisingly, all existing popular search engines, such as Google or Bing, strip away the information contained in the photos and use exclusively the text of Web pages to perform the document retrieval.

Our study is the first to show that modern machine vision systems are accurate and efficient enough to make effective use of the information contained in image pixels to improve document search, Torresani added.

Read the original post:

New artificial intelligence software boosts web searches

Related Posts

Comments are closed.