Fresh and New

Share this post
53. Buckets of data
sebchan.substack.com

53. Buckets of data

Seb Chan
Sep 19, 2020
Comment
Share

The recently released new Google Arts & Culture experiment, Gael Hughes’ An Ocean of Books, is a cute but telling example of the challenges of large heterogeneous datasets. Ostensibly a ‘discovery tool’ it uses basic book metadata from Google Books (itself built on standard library MARC records and classification practices), to group authors into island…

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2022 Seb Chan
Privacy ∙ Terms ∙ Collection notice
Publish on Substack Get the app
Substack is the home for great writing