53. Buckets of data

The recently released new Google Arts & Culture experiment, Gael Hughes’ An Ocean of Books, is a cute but telling example of the challenges of large heterogeneous datasets. Ostensibly a ‘discovery tool’ it uses basic book metadata from Google Books (itself built on standard library MARC records and classification practices), to group authors into island…

This post is for paying subscribers