Integrity
How to trust this archive.
Every origin record in this archive carries a verifiable trail of evidence: a SHA-256 of its canonical form, a Bitcoin-anchored OpenTimestamps proof of its publication date, and a mirror URL on the Internet Archive. The dataset's licence is CC BY-NC-SA 4.0; commercial use requires separate permission. This page is the audit dashboard. Verify everything yourself at /verify/.
Dataset manifest
Every source file under version control, hashed with SHA-256. The root hash binds them together — a single character change in any file flips it.
| File | SHA-256 |
|---|---|
| data/cacao-genetics.schema.json | 3bcfe2838d86c4b4… |
| data/cacao-origin.schema.json | 50a3b29639cad710… |
| data/cacao-origins.json | bbe1bfb4c5aa6bed… |
| data/countries-production.json | a1acd230c8678bd0… |
| data/countries.json | fa506104fe18966f… |
| data/glossary.json | 6e82f809026160ac… |
| data/i18n-ar.json | 737862de3d1bb023… |
| data/i18n-bn.json | a9f44f3ec82f691d… |
| data/i18n-de.json | 9dadc0b51b0c4105… |
| data/i18n-es.json | 4b2dd81a94a7095d… |
| data/i18n-fil.json | e162823675f4dd79… |
| data/i18n-fr.json | 71d5c86d6fbb579a… |
| data/i18n-hi.json | bf3b24a9590c48b9… |
| data/i18n-id.json | c574e4cf046dc05e… |
| data/i18n-it.json | f8ae8d135b813419… |
| data/i18n-ja.json | 358518db1841aad6… |
| data/i18n-ko.json | 16459c2515c099b7… |
| data/i18n-ms.json | 627f8632ce50020e… |
| data/i18n-nl.json | 44e2063190ed3a78… |
| data/i18n-pl.json | 8f18858a1365f1d1… |
| data/i18n-pt.json | 3f1a7152d95a1a3b… |
| data/i18n-ru.json | 0344c7902ebcd569… |
| data/i18n-sw.json | 2f86c5f0c1484b4f… |
| data/i18n-ta.json | 623b6dd08db5831a… |
| data/i18n-th.json | 9e84106cffb7a4a7… |
| data/i18n-tr.json | 2a9368684938f5c8… |
| data/i18n-uk.json | d41b81f4234adcb9… |
| data/i18n-vi.json | 28b47850e99d8909… |
| data/i18n-zh.json | a54f1890553fe79b… |
| data/mirrors.json | 07e899d37c044627… |
| data/ui-en.json | bb7cd8e7c17d2dec… |
| data/world-equirect.json | 6f5bc3dee526244b… |
| content/genetics.md | 72c287a5a28891be… |
| content/methodology.md | 73881dd712fe749b… |
| content/processing.md | ddbba36e29ea4225… |
OpenTimestamps anchors
Each origin record's canonical SHA-256 is timestamped on the Bitcoin blockchain via OpenTimestamps. The proofs are independently verifiable — no key, no account, no central authority.
| Origin | SHA-256 | Proof | Stamped |
|---|---|---|---|
| au-far-north-queensland | 415d99eea9403720… | ots | 2026-05-23 |
| bz-stann-creek | ea4c71a433f5228f… | ots | 2026-05-23 |
| bz-toledo-maya-mountain | 7f6858d31f0a4b67… | ots | 2026-05-23 |
| bo-alto-beni | f7c2aa483d7b0493… | ots | 2026-05-23 |
| bo-baures-itenez | 353293cff5acc5ba… | ots | 2026-05-23 |
| bo-el-ceibo | 020325a8500abb40… | ots | 2026-05-23 |
| bo-wild-beni-amazon | a70c3995c15f15fa… | ots | 2026-05-23 |
| br-bahia-ilheus | f058703656bc9a6d… | ots | 2026-05-23 |
| br-linhares-espirito-santo | 8fe69f9af7b38fd7… | ots | 2026-05-23 |
| br-medicilandia-para | 74caa1dd60b4433d… | ots | 2026-05-23 |
| br-rondonia | 6b32256cfc9de123… | ots | 2026-05-23 |
| br-tome-acu-para | 72c288d546fecdbd… | ots | 2026-05-23 |
| cm-bulk | 63287e25345f78d1… | ots | 2026-05-23 |
| co-antioquia | 2cea74aa34989af2… | ots | 2026-05-23 |
| co-arauca | 1899b3a1eadc71ce… | ots | 2026-05-23 |
| co-huila | f60a52f0cb4b07ee… | ots | 2026-05-23 |
| co-santander | 05bc0bdf2ab58cbd… | ots | 2026-05-23 |
| co-sierra-nevada-arhuaco | a84b636474757aa0… | ots | 2026-05-23 |
| co-tolima | b9e41849c11d4070… | ots | 2026-05-23 |
| co-tumaco | 93d6d64bf319d900… | ots | 2026-05-23 |
Canary checks
Stylistic and structural fingerprints that survive cleanup commits and identify the dataset across derivative works. Asserted by build/lint-canaries.mjs on every release.
Language sync
The archive ships in 24 languages. Translations are filled by a Stop-hook driven sync script that calls the `claude` CLI in headless mode — no manual translation, consistent terminology across the corpus.
Mirror network
Independent third-party archives hold copies of every release. Removal from kakao.io does not remove the record.
0 URLs on Internet Archive (Wayback Machine)
0 URLs on archive.today
124 dataset releases timestamped on Bitcoin (OpenTimestamps)
| https://kakao.io/ | — |
| https://kakao.io/zh/ | — |
| https://kakao.io/hi/ | — |
| https://kakao.io/es/ | — |
| https://kakao.io/fr/ | — |
| https://kakao.io/ar/ | — |
| https://kakao.io/bn/ | — |
| https://kakao.io/pt/ | — |
Reproducibility
Given the source manifest above and build/build.mjs, the entire site is reproducible. Run the command below to regenerate locally; the output hash must match dist-manifest.json `rootHash`.
git clone <source-bundle>
node build/build.mjs
shasum -a 256 dist-manifest.json
# compare against the root hash above