Video Intelligence release notes

This page documents production updates to Video Intelligence. We recommend that Video Intelligence developers periodically check this list for any new announcements.

You can see the latest product updates for all of Google Cloud on the Google Cloud page, browse and filter all release notes in the Google Cloud console, or programmatically access release notes in BigQuery.

To get the latest product updates delivered to you, add the URL of this page to your feed reader, or add the feed URL directly.

November 01, 2021

AutoML Action Recognition: The Streaming API is a Beta feature of Video Intelligence API for real-time versions of several capabilities such as object tracking and label detection. This current launch adds streaming support for AutoML Action Recognition models. Customers can now specify their own custom AutoML model when performing action recognition on a stream.

October 08, 2021

The SHOT_CHANGE_DETECTION model will undergo an upgrade over the next 90 days to a newer version. The API interface and client library will remain the same as the previous version.

Note that you have 30 days from today to test the new model by specifying "builtin/latest" in the model field of the config object for shot change detection. At the end of 30 days, the new model will be promoted to the default model accessible as "builtin/stable". After that event, the original model, currently accessible by default or using "builtin/stable" will still be available for another 60 days using "builtin/legacy".

Until this 30 day period ends, the model formerly accessible as "builtin/latest" will be available as "builtin/legacy". Thank you for your feedback on that model, now labeled "builtin/legacy" version. The new model launched today as "builtin/latest" has been improved over this model as well as the current default "builtin/stable" model.

If you encounter problems with this upgrade, contact the Video Intelligence API engineering team by submitting a ticket in the private issue tracker.

September 20, 2021

The CELEBRITY_RECOGNITION model will undergo an upgrade to a newer version over the next 90 days. The API interface and client library will remain same as the previous version. The API follows the same Service Level Agreement (SLA). You have 30 days from this release date to test the new model. To do so, specify "builtin/latest" in the model field of the Feature object while requesting image annotation. After the end of this 30-day period, the new version will be promoted to the default model and accessible as "builtin/stable". Going forward, the original model will still be available for another 60 days using "builtin/legacy". If you encounter problems with this upgrade, contact the Video Intelligence API engineering team by submitting a ticket in the private issue tracker.

May 04, 2021

The following features are available in the Video Intelligence API version v1:

Face detection: Locate faces within a video, and identify attributes such as glasses being worn. Learn more

Person detection: Locate people in a video, and identify attributes and 2D landmarks. Learn more

This GA launch brings significant quality improvement to both features.

September 14, 2020

Bug fix for shot change detection API: Tuned internal model parameters to reduce false positives under certain scenarios.

May 21, 2020

The following features are available in the Video Intelligence API version v1p3beta1:

Face detection: Locate faces within a video, and identify attributes such as glasses being worn. Learn more

Person detection: Locate people in a video, and identify attributes and 2D landmarks. Learn more

March 31, 2020

The following GA feature is available in the Video Intelligence API version v1:

Logo recognition: Detect, track, and recognize the presence of over 100,000 brands and logos in video content. Learn more

October 30, 2019

Cloud Video Intelligence now offers celebrity recognition to select media & entertainment companies and their designated partners. With celebrity recognition, you can inspect your video content to detect and track human faces that appear in the input video or video segment. The Video Intelligence API then compares the faces against a database of celebrities. This feature is in beta; access to the feature is restricted.

October 04, 2019

Logo recognition is now available as a beta feature. Learn more.

September 17, 2019

Cloud Video Intelligence rolled out improved models for video annotation using the OBJECT_TRACKING and TEXT_DETECTION with the v1 version of the service.

You can now specify which model you want to use with LABEL_DETECTION and SHOT_CHANGE. To specify a model using the v1 version of the service, set the model field of the LabelDetectionConfig or ShotChangeDetectionConfig to either builtin/stable or builtin/latest.

You can now specify your own custom AutoML model when performing label detection or object tracking on a stream. This feature is in beta.

June 26, 2019

Results returned from asynchronous annotation now provide resource names in the following format project/PROJECT_NAME/location/us-west1/operation/OPERATION_ID.

April 10, 2019

Live streaming video annotation is available as a beta feature. Learn more.

Object tracking is generally available for use. Learn more.

Streaming from a file is available as a beta feature. Learn more.

Text detection (OCR) is generally available for use. Learn more.

December 07, 2018

The Video Intelligence API enables you to transcribe text from speech in the audio of a video. Speech transcription can recognize multiple speakers, filter out profanity, add punctuation to the transcribed text, and more. For more information, see Speech Transcription. This feature is generally available.

October 26, 2018

Cloud Video Intelligence allows you to track an object from one moment to the next in a video. This feature is in beta.

You can use the Video Intelligence API to detect text (OCR) in a video. This feature is in beta.

August 08, 2018

Speech transcription is available as a beta feature. Learn more.

June 27, 2018

An improved label detection model is now available. The new model:

  • Leverages audio content in videos to improve label detection.
  • Is trained using more features and better calibration ground truth.

To instruct the Cloud Video Intelligence API to use the new label detection model when servicing your annotation request, set the model field of your LabelDetectionConfig to builtin/latest.

We'll support both the current model and the new model the next 90 days. After 90 days the current label detection model will be deprecated and only the new label detection model will be used for all label detection requests.

November 30, 2017

Video Intelligence API is GA: The Video Intelligence API has graduated out of beta and has reached v1. All API endpoints are updated to use https://videointelligence.googleapis.com/v1/.

September 21, 2017

Explicit content detection: SafeSearch has been renamed to Explicit Content Detection. Explicit Content Detection inspects an input video for frame-level imagery that could be considered adult content.

Explicit Content Detection is performed by using the annotate method and specifying an EXPLICIT_CONTENT_DETECTION request.

The ExplicitContentAnnotation and ExplicitContentFrame types replace the SafeSearchAnnotation type. The adult field has been renamed to pornographyLikelihood and the time field has been renamed to timeOffset. The spoof, medical, violent, and racy fields have also been removed. The timeOffset field returns a value of type Duration instead of type int64.

The labelAnnotations field returned in the response for a video annotation request has been replaced with the segmentLabelAnnotations, shotLabelAnnotations, and frameLabelAnnotations fields. This provides specific label annotations for each level of the video. The LabelLevel enum has been removed.

All annotations have been updated to return an array of LabelFrame types, an array of LabelSegment types, a list of entities, and a list of entity categories. Each LabelFrame and LabelSegment includes a confidence value.

The field description has been replaced with an Entity type, which includes both the description and an entity_id field. You can use the entity id to find more information for some entities in the Google Knowledge Graph Search API.

The LabelDetectionMode enum remains unchanged and can be set as the label_detection_mode field of the LabelDetectionConfig for your request, along with the stationary_camera field. This configurations applies only to labels. Other video intelligence features do not yet have yet any feature-specific configuration options.

The VideoSegment type is now used only in the context configuration (VideoContext) of the AnnotateVideoRequest type to allow you to pass multiple video segments in a request.

June 26, 2017

SafeSearch detection: SafeSearch inspects an input video for frame-level imagery that could be considered adult content.

SafeSearch is performed through the annotate method using a SAFE_SEARCH_DETECTION request.

May 18, 2017

Cloud Video Intelligence API is available for beta.

Shot change detection: Shot detection inspects an input video, and detects changes in shots (scenes) that occur throughout the length of the video or video segment. Shot detection is performed through the annotate method using a SHOT_CHANGE_DETECTION request.

Label detection: Label detection inspects an input video and detects entities that occur throughout the length of the video or video segment. Label detection is performed through the annotate method using a LABEL_DETECTION request.