The sections below highlight the features and capabilities of the Google Cloud Video Intelligence API.
A LABEL_DETECTION request annotates a video with labels (tags) for entities that are detected in the video or video segments provided. A LABEL_DETECTION request is performed by the annotate method. The response returns the following:
- A list of video segment annotations where an entity is detected.
- A list of frame annotations where an entity is detected.
- A list of shot annotations where an entity is detected, if shot annotations are requested in the video context configuration for the request.
For example, a video of a train at a crossing may produce labels such as "train", "transportation", "railroad crossing", and so on. Each label will also include a time segment containing a time offset (timestamp) for each label that indicates the duration from the beginning of the video. The annotation also contains additional entity information including an entity id that you can use to find more information about the entity in the Google Knowledge Graph Search API.
Explicit Content Detection detects adult content within a video. Adult content is content generally appropriate for 18 years of age and older, including but not limited to nudity, sexual activities, and pornography (including cartoons or anime). Explicit Content detection is performed through the annotate method using an EXPLICIT_CONTENT_DETECTION request.
An EXPLICIT_CONTENT_DETECTION request annotates a video with explicit content annotations (tags) for entities that are detected in the video or video segments provided. The response returns video frame information for where the explicit content is detected.
Shot change detection
Shot detection inspects an input video, and detects changes in shots (scenes) that occur throughout the length of the video or video segment. Shot detection is performed through the annotate method using a SHOT_CHANGE_DETECTION request.
A SHOT_CHANGE_DETECTION request annotates a video with video segments that are selected based on content transition. For example, a golf video following two players across the golf course with some panning to the woods for background may produce two shots: "players" and "woods," giving the developer access to the most relevant video segments showing the players for highlights.
By using the
location_id in the
users can specify the
Google Cloud Platform region where
annotation will be performed. Setting this region can be important for
regulatory purposes. The following regions are currently supported:
If no region is specified, the region is determined based on the video file location.