Recently, Google added Try the API boxes on the product pages of each of its Cloud Machine Learning APIs: Cloud Vision API, Speech API and Natural Language API. Now anyone can instantly experience the power of Google's machine intelligence on their own images, voice and text. Let's see how it works.
Try Cloud Vision APICloud Vision API enables developers to understand the content of an image by encapsulating powerful machine learning models in an easy-to-use REST API. To try it now, go to the Cloud Vision API product page and drop or open any image file onto the Try the API box. Click on the Captcha dialog box to prove you're not an automated script, and drop in your image. Here’s what the Vision API had to say about a picture I took of a Jack O’Lantern that my son and I carved at a Halloween party:
Using the label detection method of the API, Cloud Vision executes image content analysis on the uploaded image. Looks like Cloud Vision’s machine intelligence is smart enough to understand not just the object, but also the context ("halloween," “holiday,” “carving”). Awesome, isn't it? You can also see the API’s response in the raw JSON format by clicking the JSON Response tab.
Optical Character Recognition (OCR)
When you drop this image to the box and open the Text tab, you can see the OCR result.
Even though the words in this image were slanted and unclear, the OCR extracts the words and their positions correctly. It even picks up the word "beacon" on the presenter's t-shirt.
Detection of explicit images, landmarks and logos
Try Cloud Speech API
Have you noticed teenagers control their smartphones using their voice? The same voice recognition engine that powers Google Search and Google Now in modern smartphones is behind Cloud Speech API. You can now take advantage of this disruptive technology for your own applications. For example, a call center provider can use Cloud Speech API to convert audio data to text (and later, you can analyze it with Natural Language API — we will discuss that next).
Cloud Speech API also has a Try the API box. Go to the product page, click on the microphone icon, and make a recording up to 30 seconds long. When you finish recording, it uploads the audio data to the API and displays the result.
Convert your voice to text right now
Click on the microphone icon to start recording
You can also try Cloud Speech API with many languages besides English. Pick from 80 supported languages and their variants from the drop-down menu. Personally, I found the technology works impressively with Japanese too.
Try Natural Language API
Many developers use simple keyword or regular expression matches to process natural language text. In other words, they process text as unstructured data without any clue about what it means.
With Cloud Natural Language API, powerful machine learning models reveal the structure and meaning with an easy-to-use REST API. Now that you can handle text as structured data with various attributes and metadata, it’s possible to add intelligence to your application by processing, analyzing or querying on the text generated by end customers.
Let's look at Natural Language API’s Try the API box. Clicking the Analyze button to explore the default sample text.
Sentiment and syntactic analysis
For sentiment analysis of the text, click on the Sentiment tab.
According to the Cloud Natural Language API, the sentence "Sundar Pichai said in his keynote that users love their new Android phones" has a positive sentiment.
On the Syntax tab, you can see the sentence’s syntactic analysis.
The JSON response from the syntactic analysis method provides the data to build a dependency parse tree of the text, like the one pictured above. With this feature, you can split the whole sentence into several tokens, as well as the parts of speech (POS) of each token such as noun and verb, and dependencies between them. Now the unstructured data becomes structured data with insights about it.
Develop amazing apps with Cloud Machine Learning APIs
As we have seen in this article, it’s easy to experience the power of Google's latest machine learning technologies with their respective Try the API boxes.
Cloud Vision API is now generally available and ready for production use. Speech API and Natural Language API are in beta and anyone can start evaluating them. The time is now for developers to start playing with this game-changing technology.