Google Hired Photographers To Help Train the AI-Powered ‘Clips’ Camera

Google Hired Photographers To Help Train the AI-Powered ‘Clips’ Camera

All the cameras in your life thus far have been entirely under your control — you get to decide what they film and for how long. Google’s upcoming Clips camera is different. It uses machine learning to decide what’s important, then processes or discards the footage accordingly. Google unveiled this device in October 2017, but it’s still not available. It’s expected to go on sale soon, and now Google has detailed how it tuned the camera’s neural network to understand what is a significant moment.

The idea is that Clips will run continuously, but it will only save the video clips it thinks you will want. There’s also a shutter button for manual captures. It has three hours of battery life and 16GB of local storage. It doesn’t upload any video to the cloud for processing, so the on-device neural network (called Moment IQ) needed to be extremely efficient.

In many ways, the Clips is the embodiment of what Google calls “human-centered design.” That’s simply the idea that AI products should do something useful for the user without adding additional stress or complication. The Clips watches the world and gives you the video it thinks will make you smile. Training the network to understand an abstract concept like a memorable video was no easy feat, though.

Like all neural networks, Clips needed to see heaps of “good” and “bad” videos to tune its behavior. Some of the markers for the network were obvious. Video that’s blurry or partially obstructed can be disregarded easily. However, Google also needed to teach the AI about the concept of time. You don’t want too much filler in your clips, even if something interesting does happen at some point.

Google Hired Photographers To Help Train the AI-Powered ‘Clips’ Camera

The team behind Clips initially just showed the model images they considered beautiful or interesting. Along with an understanding of depth of field, lighting, and the rule of thirds, Googlers thought the network would be able to understand what makes a good clip. However, that wasn’t the case. To help engrain the nuance of memorable video in Clips, Google hired a documentary filmmaker, a photojournalist, and a fine arts photographer to decide what images to use in training Clips.

Google admits the AI won’t be perfect, but the team seems confident that it will be capable of generating “good” videos. We won’t know how much of the video from Clips will be good until it launches. There’s still no release date, but it will cost $250 when it comes out.

Continue reading

Introducing TRACBench, a New AI-Powered Transcoding Benchmark
Introducing TRACBench, a New AI-Powered Transcoding Benchmark

Three of the fastest workstation CPUs on Earth slug it out in an AI-powered transcoding benchmark we designed to scale from dual-core to 64-core systems.

AI-Powered Electronic Design Automation Tools Could Redefine Chipmaking
AI-Powered Electronic Design Automation Tools Could Redefine Chipmaking

Samsung has announced it's using AI EDA tools to build its Exynos processors. Expect more announcements in the future, but geared more towards gradual adoption than a sudden surge.

IBM Builds an AI-Powered Electronic Tongue
IBM Builds an AI-Powered Electronic Tongue

The tongue, called Hypertaste, can detect and analyze a liquid's chemical composition.

Google Is Testing Its Latest AI-Powered Robot In a Kitchen
Google Is Testing Its Latest AI-Powered Robot In a Kitchen

Using a 540-billion parameter language network, this wheeled robot can assess its own capabilities, its environment, and the task at hand, then break that task into smaller sub-tasks to achieve the desired goal.