TL;DR
A public Hacker News archive is available via the Hacker Book site, published by DOSAYGO. The site lists a dataset described on the page with 46,399,072 items and a reported storage size of 8.5GB spanning Oct 9, 2006 to Dec 28, 2025.
What happened
On a Hacker Book page published December 30, 2025, DOSAYGO published a mirror or export of Hacker News that is presented for browsing and download. The page displays a snapshot of the HN front page for Sunday, December 28, 2025, and includes a line of metadata reporting 46,399,072 items arranged across 1,637 shards, with a total reported size of 8.5GB and coverage from Oct 9, 2006 through Dec 28, 2025. The site offers a "GET THIS" link that appears intended to let users obtain the dataset. The page also notes that times are shown relative to 11:59 PM when viewing the archive. The listing shows typical HN front-page entries with scores and comment counts alongside the dataset metadata.
Why it matters
- Provides a locally queryable archive that could aid researchers, journalists, and developers working on historical analysis of Hacker News.
- A consolidated SQLite export simplifies offline analysis and tooling compared with scraping live web pages.
- The dataset size and multi-year coverage indicate significant storage and indexing considerations for anyone planning large-scale analysis.
- Making an archive available in a standard format may improve reproducibility for studies that use HN data.
Key facts
- Site: Hacker Book (hackerbook.dosaygo.com) — page published 2025-12-30T17:01:59+00:00.
- Creator listed as DOSAYGO; page includes a "GET THIS" link for obtaining the dataset.
- Metadata on the page reports 46,399,072 items and 1,637 shards.
- Reported total size on the page is 8.5GB.
- Coverage shown on the page spans Oct 9, 2006 to Dec 28, 2025.
- The page displays Hacker News front-page items for Sunday, December 28, 2025; times are shown relative to 11:59 PM.
- The page is presented as an archive view of Hacker News with navigation and an archive index.
What to watch next
- Whether the downloadable file available via "GET THIS" is a SQLite database and what its internal schema contains — not confirmed in the source.
- Any licensing, terms of use, or rate-limit considerations governing reuse of the dataset — not confirmed in the source.
- If updates or expanded exports (different sizes or coverage) are posted in future by DOSAYGO — not confirmed in the source.
Quick glossary
- Hacker News: A social news website run by Y Combinator where users submit technology- and startup-related links and discussion.
- SQLite: A lightweight, file-based relational database engine commonly used to store structured data locally in a single file.
- Shard: A subdivision of a dataset used to split data into smaller parts for storage, processing, or distribution.
- Archive snapshot: A captured state of a dataset or website at a particular point in time for preservation or analysis.
Reader FAQ
Who published this Hacker News archive?
The page lists DOSAYGO as the creator of the Hacker Book archive.
How large is the dataset?
The page reports a total size of 8.5GB.
Does the dataset cover the entire Hacker News history?
The page shows coverage from Oct 9, 2006 to Dec 28, 2025.
Is the dataset the 22 GB export mentioned in the original headline?
not confirmed in the source
Is the download available directly from the page?
The page includes a "GET THIS" link that appears intended for obtaining the dataset, but exact details of the file are not provided on the page.
Hacker Book new | front | start | ask | show | jobs | query Sunday, December 28, 2025 < > ARCHIVE 1. Floor796 (floor796.com) 719 points by krtkush 19…
Sources
- Show HN: 22 GB of Hacker News in SQLite
- Ask HN: Have you used SQLite as a primary database?
- 100 Best Data Startups of Hacker News Show HN – Dec 2025
- dogsheep/hacker-news-to-sqlite
Related posts
- Loss32: Build a Win32-Centric Linux Desktop to Run .exe Apps
- Enabling PowerVR GPU on RISC-V TH1520: Kernel Plumbing to 3D Rendering
- Ask HN: Examples of small, vibe-coded products people actually built and used