Google Cloud Platform Blog
Product updates, customer stories, and tips and tricks on Google Cloud Platform
Google Cloud Vision API changes the way applications understand images
Wednesday, December 2, 2015
Have you ever wondered how Google Photos helps you find all your favorite dog photos? With today’s release of
Google Cloud Vision API
, developers can now build powerful applications that can see, and more importantly understand, the content of images. The uses of Cloud Vision API are game changing to developers of all types of applications and we are very excited to see what happens next!
Advances in machine learning, powered by platforms like
TensorFlow
, have enabled models that can learn and predict the content of an image. Our limited preview of
Cloud Vision API
encapsulates these sophisticated models as an easy-to-use REST API. Cloud Vision API quickly classifies images into thousands of categories (e.g., "boat", "lion", "Eiffel Tower"), detects faces with associated emotions, and recognizes printed words in many languages. With
Cloud Vision API
, you can build metadata on your image catalog, moderate offensive content, or enable new marketing scenarios through image sentiment analysis.
The following set of Google Cloud Vision API features can be applied in any combination on an image:
Label/Entity Detection
picks out the dominant entity (e.g., a car, a cat) within an image, from a broad set of object categories. You can use the API to easily build metadata on your image catalog, enabling new scenarios like image based searches or recommendations.
Optical Character Recognition
to retrieve text from an image. Cloud Vision API provides automatic language identification, and supports a wide variety of languages.
Safe Search Detection
to detect inappropriate content within your image. Powered by Google SafeSearch, the feature enables you to easily moderate crowd-sourced content.
Facial Detection
can detect when a face appears in photos, along with associated facial features such as eye, nose and mouth placement, and likelihood of over 8 attributes like joy and sorrow. We don't support facial recognition and we don’t store facial detection information on any Google server.
Landmark Detection
to identify popular natural and manmade structures, along with the associated latitude and longitude of the landmark.
Logo Detection
to identify product logos within an image. Cloud Vision API returns the identified product brand logo, with the associated bounding polybox.
You can currently call the API by embedding an image as part of the request. In future phases, we will add support for integrating with
Google Cloud Storage
. The Vision API enables you to request one or more annotation types per image.
To show a simple example of the Vision API, we have built a fun Raspberry Pi based platform with just a few hundreds of lines of Python code, calling the Vision API. Our demo robot can roam and identify objects, including smiling faces. This is just one simple example of what can be done with Cloud Vision API:
Aerosense
, a subsidiary of Sony Mobile Communications Inc, was among the first early testers to use Cloud Vision API and had some initial feedback to share:
To join the Limited Preview, please sign up
here
. We cannot wait to see what amazing applications you build with Vision API, and we look forward to
hearing from you
!
-
Posted by Ram Ramanathan, Product Manager, Google Cloud Platform
Free Trial
GCP Blogs
Big Data & Machine Learning
Kubernetes
GCP Japan Blog
Firebase Blog
Apigee Blog
Popular Posts
World's largest event dataset now publicly available in BigQuery
A look inside Google’s Data Center Networks
Enter the Andromeda zone - Google Cloud Platform’s latest networking stack
Using labels to organize Google Cloud Platform resources
New in Google Cloud Storage: auto-delete, regional buckets and faster uploads
Labels
Announcements
193
Big Data & Machine Learning
134
Compute
271
Containers & Kubernetes
92
CRE
27
Customers
107
Developer Tools & Insights
151
Events
38
Infrastructure
44
Management Tools
87
Networking
43
Open
1
Open Source
135
Partners
102
Pricing
28
Security & Identity
85
Solutions
24
Stackdriver
24
Storage & Databases
164
Weekly Roundups
20
Feed
Subscribe by email
Demonstrate your proficiency to design, build and manage solutions on Google Cloud Platform.
Learn More
Technical questions? Check us out on
Stack Overflow
.
Subscribe to
our monthly newsletter
.
Google
on
Follow @googlecloud
Follow
Follow