Enhance your analytics and AI initiatives with pre-built data solutions and valuable datasets powered by BigQuery, Cloud Storage, Earth Engine, and other Google Cloud services.
Category | Featured datasets | Sample use cases and insights |
---|---|---|
Google datasets | View the Top 25 and Top 25 rising queries from Google Trends from the past 30-days with this dataset. Each term includes 5 years of historical data across the 210 Designated Market Areas (DMAs) in the US and now over 50 countries across the globe. |
|
The dataset provides 12 months (August 2016 to August 2017) of obfuscated Google Analytics 360 data from the Google Merchandise Store to show what an ecommerce website would see, including traffic source, content, and transactional data. |
| |
Google Patents Research Data contains the output of much of the data analysis work used in Google Patents (patents.google.com), including machine translations of titles and abstracts from Google Translate, embedding vectors, extracted top terms, similar documents, and forward references. |
| |
Commercial datasets | Crux Deliver is a managed service for data engineering and operations. Crux wires up all of the traditional and alternative data providers on behalf of its clients and manages all aspects of onboarding, data engineering, and operations. Every dataset is validated so that we only deliver clean and actionable data. |
|
Exchange Data International (EDI) helps the global financial and investment community make informed decisions. EDI’s extensive content database includes worldwide equity and fixed income corporate actions, dividends, static reference data, closing prices, and shares outstanding. |
| |
FactSet is a global provider of integrated financial information, analytical applications, and industry-leading service that delivers superior content, analytics, and flexible technology. |
| |
Instant access to reliable property, loan and valuation information for 100M homes. ML algorithms process hundreds of data sources to provide Home Price Indices for 381 Metros, 18,300 ZIP codes and 4M blocks covering >95% of the US residential market. |
| |
LinkUp, the global leader in accurate, real-time, and predictive job market data and analytics offers proprietary data solutions that give customers the ability to derive valuable insights into the global labor market and help investors generate alpha at the macro, sector, geographic, and individual company level. |
| |
LEBA’s solution gives customers the ability to access a unique, consolidated view of the Energy markets from across the main energy brokers. Energy, Oil and Gas producers, wholesale users, utilities, and financial traders benefit from independent market information based on traded activity rather than price assessments. |
| |
Neustar, Inc., a TransUnion company, is a leader in identity resolution providing the data and technology that enable trusted connections between companies and people at the moments that matter most. Neustar offers industry-leading solutions in marketing, risk and communications. |
| |
RS Metrics, the leading company for asset-level, real-time, objective and verifiable ESG data, gives customers the ability to access accurate insights into EV manufacturers’ factory inventory levels. |
| |
Ursa Space Systems, a global satellite intelligence infrastructure provider, gives customers the ability to monitor global economic trends with data derived from satellite imagery, updated on a weekly basis. |
| |
Public datasets | The Storm Events Database is an integrated database of severe weather events across the United States from 1950 to this year, with information about a storm event's location, azimuth, distance, impact, and severity, including the cost of damages to property and crops. |
|
These are full-resolution boundary files, derived from TIGER/Line Shapefiles, the fully supported, core geographic products from the US Census Bureau.These include information for the 50 states, the District of Columbia, Puerto Rico, and the outlying island areas. |
| |
The American Community Survey (ACS) is an ongoing survey that provides vital information on a yearly basis about our nation and its people by contacting over 3.5 million households across the country. The resulting data provides incredibly detailed demographic information across the US aggregated at various geographic levels. |
| |
Search for and access over 200 datasets listed in Google Cloud Marketplace. |
| |
Earth Engine datasets | Earth Engine's public data archive includes more than forty years of historical imagery and scientific datasets, updated daily and available for online analysis. |
|
Kaggle datasets | Inside Kaggle you’ll find all the code and data you need to do your data science work. Use over 80,000 public datasets and 400,000 public notebooks to conquer any analysis in no time. |
|
Synthetic datasets | The synthetic data represents transactions from automated trading bots operated by the fictional Cymbal Investments group, each using a single algorithm to guide its trading decisions. The records are derived from FIX protocol (version 4.4) Trade Capture Reports loaded into BigQuery. |
|
Research datasets | Google's Dataset Search program has indexed almost 25 million datasets from across the web, giving you a single place to search for datasets and find links to where the data is. Filter by recency, format, topic, and more. |
|
Featured datasets
View the Top 25 and Top 25 rising queries from Google Trends from the past 30-days with this dataset. Each term includes 5 years of historical data across the 210 Designated Market Areas (DMAs) in the US and now over 50 countries across the globe.
What are the most popular retail items people have searched for across the area?
Crux Deliver is a managed service for data engineering and operations. Crux wires up all of the traditional and alternative data providers on behalf of its clients and manages all aspects of onboarding, data engineering, and operations. Every dataset is validated so that we only deliver clean and actionable data.
What are the datasets Crux can help me onboard into my data ecosystem?
The Storm Events Database is an integrated database of severe weather events across the United States from 1950 to this year, with information about a storm event's location, azimuth, distance, impact, and severity, including the cost of damages to property and crops.
Use case: home improvement retailer understanding impact of storms on inventory
Technical Reference Pattern: Dynamic insurance pricing model using this dataset
Earth Engine's public data archive includes more than forty years of historical imagery and scientific datasets, updated daily and available for online analysis.
How has surface temperature changed over the past 30 years?
What did this area look like before year 2000?
Inside Kaggle you’ll find all the code and data you need to do your data science work. Use over 80,000 public datasets and 400,000 public notebooks to conquer any analysis in no time.
Can you tackle some of the most vexing and provocative problems in data science?
The synthetic data represents transactions from automated trading bots operated by the fictional Cymbal Investments group, each using a single algorithm to guide its trading decisions. The records are derived from FIX protocol (version 4.4) Trade Capture Reports loaded into BigQuery.
How much did traders make from each individual trade?
Google's Dataset Search program has indexed almost 25 million datasets from across the web, giving you a single place to search for datasets and find links to where the data is. Filter by recency, format, topic, and more.
What datasets exist for < keyword you're interested in >?
Which sustainability datasets exist from last year are free for commercial use?
With BigQuery sandbox, you can try the full BigQuery experience without a billing account or credit card.
Learn more from both sides of the dataset ecosystem: data providers and data consumers.
Start building on Google Cloud with $300 in free credits and 20+ always free products.