Visualizing All ISBNs

annas-archive.org

387 points by RyanShook 4 days ago


graypegg - 3 days ago

I see that bounty at the bottom, so tossing away my chances here, but this visualization is just asking to be mapped onto a Hilbert Curve. [0] When you "stripe" the data like this, points that are sorted close together could end up pretty far apart, since a distance in the Y axis skips an entire row of data as you move down, rather than a distance in the X axis which is 1-to-1 with the source data.

If you map it onto a hilbert curve, the X and Y axis mean nothing, but visually points that are close together in the sorted list, will be visually close together in the output image.

Since the first part of an ISBN is the country, then the second part is the publisher, and the third part is the title, with a check sum at the end, I would remove the checksum and sort them each as a big number. (no hyphens)

You should end up with "islands", where you see big areas covered by big publishing countries, with these "islands" having bright spots for the publisher codes.

Bonus points for labeling these areas!

I set up something a while ago [1] for an interview that does this with weather data. It makes the seasons really obvious since they're all grouped together.

[0] https://en.wikipedia.org/wiki/Hilbert_curve

[1] https://graypegg.com/hilbert (https://github.com/graypegg/hilbertcurveplayground code if anyone wants to go for the prize using this! Please at least mention me if you decide to reuse this code, but I can't stop ya lol)

WillAdams - 3 days ago

The thing is, ISBNs aren't hierarchical --- they are bought in blocks (or even individually at an exorbitant markup, says the guy who bought one to reprint a single book), so this doesn't show anything really interesting/useful.

A visualization using LoC or even Dewey Decimal would be far more useful, esp. if it also linked to public domain and copyright-free repositories/lists, say an interactive and visual version of John Mark Ockerbloom's:

https://onlinebooks.library.upenn.edu/

skrebbel - 3 days ago

I thought it was my color blindness that made me not able to distinguish between the red and green pixels as described (i only see red and black ones), but even with a browser extension that counters color blindness i can't distinguish more colors. Is this just me, or is the graph weird?

glimshe - 3 days ago

Anna's archive is one of the wonders of the world. If we almost destroyed our species but Anna's archive endured, there would be hope for a relatively expedient reconstruction.

jdblair - 3 days ago

It appears that the IP of the server is blocked in the EU. I get this from my ISP (Ziggo, in the Netherlands):

Deze website is geblokkeerd

Europese sancties

De Raad van Europa heeft besloten dat de websites van RT (voorheen Russia Today) en Sputnik News niet meer mogen worden doorgegeven. De website die je probeert te bezoeken, valt onder deze Europese sanctie.

VodafoneZiggo is verplicht de sanctie uit te voeren en heeft de website geblokkeerd.

billpg - 3 days ago

Anyone else seeing this?

"This server couldn't prove that it's annas-archive.org; its security certificate is from *.hs.llnwd.net. This may be caused by a misconfiguration or an attacker intercepting your connection."

quink - 3 days ago

Kind of hard to tell what corresponds to what in these graphs, maybe if someone could point out Bookland (i.e. 978), it would be a bit easier to orient oneself?

greenie_beans - 3 days ago

is it illegal to download and use their isbn file? like what is wrong with having that information?

whataguy - 3 days ago

> Each pixel represents 2,500 ISBNs. If we have a file for an ISBN, we make that pixel more green.

What do you mean by "more green"? I don't see any shaded green.

And I presume the black pixels are unregistered ISBNs?

usr1106 - 3 days ago

What is Anna's archive and why is it blocked by law enforcement in several European countries (EU + UK)?

eporomaa - 3 days ago

Hm, I got:

"...

European sanctions

The Council of Europe has decided that the websites of RT (formerly Russia Today) and Sputnik News may no longer be transmitted. The website you are trying to visit falls under this European sanction.

..."

ge96 - 3 days ago

Ooh prize money, D3 those are fun, where you can map a million things/zoom into it

friend_Fernando - 3 days ago

Isn't it interesting how certain online forces affiliated with the letter Z are against copyright for Western IP in general, but are pro copyright when it comes to hamstringing Western AI?

netman21 - 3 days ago

Hee, hee. "Imperial Library of Trantor."

qingcharles - 3 days ago

Now do ISSNs, please.

starlite-5008 - 3 days ago

[dead]

Over2Chars - 3 days ago

[flagged]