What’s in This Picture? AI Becomes as Smart as a Toddler

Developments in machine learning allow computers to answer more complex questions about the contents of images

An employee walks past dome structures used as nap rooms in the Baidu headquarters in Beijing.
Photographer: Tomohiro Ohsumi/Bloomberg

Provide news feedback or report an error

Confidential tip?

Send a tip to our reporters

Site feedback:

Take our Survey

By Jack Clark

May 22, 2015 at 1:16 AM UTC

This article is for subscribers only.

Artificial intelligence has graduated past the infancy stage of figuring out what's in an image. Computers have previously been capable of little more than a simple game of I Spy: Name a specific object or person, and they'll show you an image containing it. But thanks to new developments in AI research, machines can now answer more complex questions, like, “What is there on the grass, except the person?” (For the answer to that awkwardly worded enigma, take a look at the last image.)

A research paper published on Thursday in Cornell University's Arxiv outlines a system that learns to identify fine-grained visual features of images, and the words associated with them. Then it combines the two into a dictionary in its digital brain. It then references this to answer new questions about never-before-seen images.