Fresh and New

Share this post

53. Buckets of data

sebchan.substack.com

53. Buckets of data

Seb Chan
Sep 19, 2020
∙ Paid
Share
Share this post

53. Buckets of data

sebchan.substack.com

The recently released new Google Arts & Culture experiment, Gael Hughes’ An Ocean of Books, is a cute but telling example of the challenges of large heterogeneous datasets. Ostensibly a ‘discovery tool’ it uses basic book metadata from Google Books (itself built on standard library MARC records and classification practices), to group authors into island…

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2023 Seb Chan
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing