Google's new AI knows which images you'll like - before you've even seen them
A Google algorithm has learned which pictures we'll find 'aesthetically near-optimal'. Image: REUTERS/Juan Medina
Machines can now rank photos according to their aesthetic appeal, in much the same way humans do, thanks to a new artificial intelligence (AI) model created by researchers at Google.
Neural Image Assessment, or NIMA, was designed using a machine learning algorithm known as a convolutional neural network (CNN), which uses data that’s been rated and labeled by humans.
Unlike existing models which typically only categorize images based on their quality, NIMA can be trained to predict which images a human would rate as technically good and aesthetically attractive.
NIMA achieves this by recognizing and classifying images based on a variety of characteristics humans associate with emotions and beauty, scoring each on a scale of one to 10, the researchers said in a blog post announcing the technology.
“This is more directly in line with how training data is typically captured, and it turns out to be a better predictor of human preferences when measured against other approaches,” the researchers claim.
To test the model’s capabilities, NIMA was pitted against photos from the large-scale database for Aesthetic Visual Analysis (AVA). Each AVA photo is scored by an average of 200 people in response to photography contests.
After training, the aesthetic ranking of many of these photos by NIMA closely matched the mean scores given by humans, as illustrated below.
According to Google, NIMA scores can be used to compare the quality of different versions of the same image, which may have been distorted in various ways, while also being used to enhance the perceptual quality of an image, finding “aesthetically near-optimal” settings for brightness, highlights and shadows.
NIMA could also be used to improve photography in real-time and assist in the editing process, the researchers say.
Speaking with authority
Interestingly, NIMA is just one of a number of machine learning tools with near human-capabilities that Google has developed.
For example, a research paper published by the tech giant in January details a text-to-speech system called Tacotron 2, which claims near-human accuracy at imitating audio of a person speaking from text.
The system can also handle hard-to-pronounce words and names, as well as alter the way it enunciates based on punctuation. For instance, capitalized words are stressed, as someone would do when indicating that specific word is an important part of a sentence.
Chess champion
Meanwhile, researchers from Google’s DeepMind AI lab recently developed AlphaZero, which absorbed all of humanity’s chess knowledge in around four hours.
After being programmed with only the rules of chess, AlphaZero had mastered the game to the extent it was able to beat the highest-rated chess-playing program, Stockfish.
David Kramaley, CEO of chess science website Chessable, said: “It will no doubt revolutionize the game, but think about how this could be applied outside chess. This algorithm could run cities, continents, universes.”
Don't miss any update on this topic
Create a free account and access your personalized content collection with our latest publications and analyses.
License and Republishing
World Economic Forum articles may be republished in accordance with the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Public License, and in accordance with our Terms of Use.
The views expressed in this article are those of the author alone and not the World Economic Forum.
Stay up to date:
Digital Communications
Related topics:
The Agenda Weekly
A weekly update of the most important issues driving the global agenda
You can unsubscribe at any time using the link in our emails. For more details, review our privacy policy.
More on Emerging TechnologiesSee all
Hope French and Michael Atkinson
November 7, 2024