Extract video chapters

Extract the chapters in a video as structured JSON

You can query a model directly and test the results returned when using different parameter values with the Cloud console, or by calling the Vertex AI API directly.

Prompt Freeform

Chapterize the video content by grouping the video content into chapters and providing a summary for each chapter. Please only capture key events and highlights. If you are not sure about any info, please do not make it up. Return the result in the JSON format with keys as follows : "timecode", "chapterSummary"

Response

[
  {
    "timecode": "00:00-00:04",
    "chapterSummary": "The video starts with an aerial view of Rio de Janeiro, showcasing the Christ the Redeemer statue and the coastline. The title card for \"Google presents Beyond the Map\" is displayed."
  },
  {
    "timecode": "00:04-00:10",
    "chapterSummary": "The video shows various scenes of Rio's famous beaches, including people playing soccer, a vendor selling hats, and a man relaxing on the sand. An aerial view of the city's landscape is also shown."
  },
  {
    "timecode": "00:10-00:18",
    "chapterSummary": "The video continues to showcase the well-known aspects of Rio, such as a skateboarder, a man sunbathing, and people walking along the beach. It also features a street musician playing a guitar."
  },
  {
    "timecode": "00:18-00:35",
    "chapterSummary": "The video shifts focus to the favelas, highlighting the dense housing and daily life within these communities. It also touches upon the negative perceptions often associated with favelas, such as crime and poverty."
  },
  {
    "timecode": "00:35-00:48",
    "chapterSummary": "The video explains Google's initiative to map the favelas, emphasizing the importance of providing addresses to residents and fostering a better understanding of these communities. It shows scenes of daily life in the favelas, including children playing soccer and people going about their daily routines."
  },
  {
    "timecode": "00:48-00:57",
    "chapterSummary": "The video concludes with text stating that over 1.4 million people in Rio live in favelas, inviting viewers to explore their world in 360° via a provided link. The Google logo and website address are displayed."
  }
]
Model: gemini-1.5-flash-002
Temperature: 0.2
Max output tokens: 8192
TopK: 40.0
TopP: 0.95