Google Tech Describes Photos Using Natural Language

Google reveals auto-captioning technology that describes the content of photos

Putting pictures on the Web may have many potential uses, be it for personal purposes or for the greater good of the online masses. But tagging said images with useful information for searching purposes can be a lengthy process, especially if you have thousands or millions of photos.

With that in mind, Google has revealed a new captioning system that recognizes the content of photos and automatically tags them with descriptions using natural language.

Though there are many examples of intelligent computer vision software that can auto-tag images, this takes things a step further by enabling full descriptions. This could be ‘two dogs play in the grass’, or ‘a little girl in a pink hat is blowing bubbles’.

As you can see from these snapshots, it’s still not entirely accurate all the time, but the fact that this is even close to being realized with even a degree of accuracy, is pretty exciting.

TNW City Coworking space - Where your best work happens

A workspace designed for growth, collaboration, and endless networking opportunities in the heart of tech.

Book a tour now

While it’s still an early-stage research project, this holds significant promise for the future of artificial intelligence and machine-learning.

“This kind of system could eventually help visually impaired people understand pictures, provide alternate text for images in parts of the world where mobile connections are slow, and make it easier for everyone to search on Google for images,” the company says in a blog post.

You can read a more detailed summary of the technology here, or click on the link below to peruse the full paper.

➤ Show and Tell: A Neural Image Caption Generator [Cornell University Library]

Story by Paul Sawers

Paul Sawers was a reporter with The Next Web in various roles from May 2011 to November 2014. Follow Paul on Twitter: @psawers or check h (show all) Paul Sawers was a reporter with The Next Web in various roles from May 2011 to November 2014. Follow Paul on Twitter: @psawers or check him out on Google+.

Get the TNW newsletter

Get the most important tech news in your inbox each week.

Also tagged with

Google

Google reveals auto-captioning technology that describes the content of photos

Get the TNW newsletter

Also tagged with

A German court says Google’s AI Overviews are Google’s own words, and it’s liable when they’re false

Google DeepMind’s TacticAI can predict football plays 8 seconds before they happen. Palmeiras is the first to use it.

Discover TNW All Access

Google is funding 300,000 electricians and welders, because the AI boom is running out of them

Your face is the ticket: Google’s Gemini and biometric gates are the World Cup’s quieter tech story