Fresh and New

Fresh and New

Share this post

Fresh and New
Fresh and New
53. Buckets of data

53. Buckets of data

Seb Chan
Sep 19, 2020
∙ Paid

Share this post

Fresh and New
Fresh and New
53. Buckets of data
Share

The recently released new Google Arts & Culture experiment, Gael Hughes’ An Ocean of Books, is a cute but telling example of the challenges of large heterogeneous datasets. Ostensibly a ‘discovery tool’ it uses basic book metadata from Google Books (itself built on standard library MARC records and classification practices), to group authors into island…

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Seb Chan
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share