Google Cloud and YouTube-8M Challenge: Predict YouTube video tags for a chance to win up to $30K
Philippe Poutonnet
Product Marketing Lead
Mike Styer
Strategic Partner Development Manager
In partnership with YouTube, Google Research and Kaggle, Google Cloud Platform (GCP) invites you to participate in a large-scale video classification and representation learning task:
Using Google Cloud Machine Learning, TensorFlow, or your favorite machine learning framework, the competition challenges you to develop classification algorithms that accurately assign video-level labels using the YouTube-8M dataset. The dataset was created from over 7 million YouTube videos (450,000 hours of video) and includes video labels from a vocabulary of 4,716 classes (3.4 labels/video on average). It also comes with pre-extracted audio and visual features from every second of video (3.2B feature vectors in total). By taking part, Kagglers can not only play a pivotal role in setting state-of-the-art benchmarks, but can also improve search and organization of video archives.
Are you up for the challenge?
Some of the biggest breakthroughs in machine learning and machine perception have come thanks to large labeled datasets such as ImageNet, which includes millions of images labeled with thousands of classes, and has significantly accelerated research in image understanding. Google has released many such datasets for Cloud Machine Learning, from Word Vector Models to Deep Learning for Robots, and more recently a few vision-related datasets, including Open Images, YouTube-8M and YouTube-BoundingBoxes.Video represents another great opportunity to detect and recognize objects and understand human actions and interactions with the world. Improving our understanding of video imagery can lead to better video search, organization and discovery —
for personal memories, enterprise video archives or public video collections.
Getting started
- Review the data page for special instructions on how to access the competition's data. It will be hosted on Google Cloud. Participants have the option to download the data to work locally or work within the Google Cloud Machine Learning beta Platform.
- Review the tutorial on Getting Started with Google Cloud, and try the starter code.
- Sign up for a Google Cloud Machine Learning Platform free trial account. The free trial account includes $300 in credits!
- Participants who expend their $300 free trial credits may be eligible to earn additional Google Cloud credits. Apply here, only after you have exhausted the free trial credits. Review the Prizes section for full details on how to earn additional credits. It may take up to one week to issue your coupon.
- We've also provided a subsample of the data to explore on Kernels. Take a look at this Python notebook and create your own.
- Don't forget to review the prize eligibility details, which includes requirements for code open-sourcing and a paper submission.