Hacker News Data

This dataset contains all stories and comments from Hacker News from its launch in 2006. Each story contains a story ID, the author that made the post, when it was written, and the number of points the story received.

You can start exploring this data in the BigQuery console:

Go to the Hacker News Dataset

Sample queries

Here are some examples of SQL queries you can run on this data in BigQuery.

These samples use BigQuery’s legacy SQL by setting the #legacySQL prefix. For more information, see Setting a query prefix.

How are Hacker News story points distributed?

If you use the score as a dimension (group by score, in SQL) and count the number of posts with each score, you can get an idea about how likely a story is to get a given score.

Where do the stories live?

By parsing out the host from the URL you can see where Hacker News stories originate.

About the data

Dataset Source: Hacker News

Category: Media, Social

Use: This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source — https://github.com/HackerNews/API — and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.

Update Frequency: Daily

View in BigQuery: Go to the Hacker News dataset

Monitor your resources on the go

Get the Google Cloud Console app to help you manage your projects.

Send feedback about...