Microsoft developed an AI that creates amazing caricatures

Stanford graduate student Kaidi Cao will join fellow AI researchers Jing Liao, of City University of Hong Kong, and Lu Yuan of Microsoft at SIGGRAPH Asia in Tokyo this December to present their incredible caricature-drawing neural network.

That’s not bad, considering Cao was only an intern at the Visual Computing Group at the Microsoft Research Lab in Beijing when he worked on the project.

The AI, actually a pair of generative adversarial networks (GAN), is called CariGANs. The first of its neural networks, CariGeoGAN, determines the geometry of a face in a photograph and maps it to a caricature model. CariStyGAN, the other half of CariGANs, does the “style transfer,” or applies the artistic look to the geometry map.

The <3 of EU tech

The latest rumblings from the EU tech scene, a story from our wise ol' founder Boris, and some questionable AI art. It's free, every week, in your inbox. Sign up now!

In order to imbue CariGANs with the ability to turn a relatively boring photograph into a delightful feast for your eyes (tourists on the boardwalk, I’m talking to you) the system was trained on thousands of hand-drawn images.

To determine the efficacy of the machine, the researchers conducted two studies. The first was to ensure the AI’s caricatures retained the identity of the portrait subject. The assertion here is that a good caricature has to capture a person’s essence in exaggerated form. According to the researchers, respondents indicated the CariGANs caricatures compared favorably to hand-drawn artists’.

The researchers conducted the second study to determine if the overall effectiveness of the “drawing” compared with human-drawn pieces. This too appears to indicate success:

Note that ours is ranked better than the hand-drawn one 22.95% of the times, which means our results sometime can fool users into thinking it is the real hand-drawn caricature. Although it is still far from an ideal fooling rate (i.e., 50%), our work has made a big step approaching caricatures drawn by artists, compared to other methods.

The CariGANs AI can also parse frames from video and create caricatures from it. Basically, it can generate a drawing from a single frame that is consistent with ones generated from other frames. The source images in the following picture are taken from the individual frames of an public-domain video of the President speaking.

This could be incredibly useful for animators. It’s also a hilariously spot-on way to look at the president, and proof that “art” created in tandem with an AI can stir something in the human spirit.

The CariGANs AI can also reverse-engineer a caricature and determine what the person in the cartoon really looks like. The researchers say “We believe it might be useful for face recognition in caricatures.”

That sounds a bit terrifying. But if it means I can use an ink drawing of me with a giant head and a sombrero as ID in the future, count me in.

Story by Tristan Greene

Editor, Neural by TNW

Tristan is a futurist covering human-centric artificial intelligence advances, quantum computing, STEM, physics, and space stuff. Pronouns: (show all) Tristan is a futurist covering human-centric artificial intelligence advances, quantum computing, STEM, physics, and space stuff. Pronouns: He/him

Get the TNW newsletter

Get the most important tech news in your inbox each week.

Also tagged with

Microsoft

Microsoft developed an AI that creates amazing caricatures

Get the TNW newsletter

Also tagged with

New technique makes AI hallucinations wake up and face reality

Microsoft to pump €3.2B into German AI technologies

Join TNW All Access

New hope for Microsoft-Activision deal after UK regulator reopens consultation

Quantinuum, Microsoft claim to have quieted quantum computing ‘noise’