Stay organized with collections
Save and categorize content based on your preferences.
Text Detection performs Optical Character Recognition (OCR) to detect
visible text from frames in a video, or video segments, and returns the
detected text along with information about the frame-level location and
timestamp in the video for that text.
Text Detection is particularly useful for media & entertainment use cases,
including, detecting and extracting cast lists at the end of shows and movies,
or detecting the presence of burnt-in subtitles.
Text detection is available for the languages
supported by the Cloud Vision API.
To detect visible text from a video or video segments, call the
annotate
method and specify
TEXT_DETECTION
in the features field.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-02-14 UTC."],[],[]]