How does Shazam work?

perthirtysix.com

158 points by datadrivenangel 3 days ago


swyx - 3 hours ago

related comments from Shazamers

- OG shazam paper https://www.ee.columbia.edu/~dpwe/papers/Wang03-shazam.pdf (he has a talk on youtube btw look it up if really care)

- https://news.ycombinator.com/item?id=18069968 shazam employee blogpost

- https://news.ycombinator.com/item?id=38538996 shazam cofounder endorsed explainer

- go algo repro https://news.ycombinator.com/item?id=41127726

as with all ML things... the code is much less % of the value than the data...

thakoppno - 3 hours ago

Perhaps obviously this is the same technique that enables ACR on TVs.

It occurs to me that Shazam has such a better reputation online because the intent and consent of the user is honored.

It makes me wonder if there couldn’t be an implementation on TVs that is similar and actually is a net positive for consumers. Basically would customers actually like TV ACR if the data wasn’t just going to sell more ads?

larodi - 2 hours ago

There's an algo called dynamic time warping (DTW) and is very often overlooked. My wild guess would be is at play @Shazam.

sandos - an hour ago

Well, my latest guess is: not at all.

It has been working "fine" for me generally for popular music. But then I was at a ice skating competition where there were some really nice synth:y music going on in the pauses, and I used Shazam on several of the songs, and I tried several times on each. It did not find a single one correctly.

Either this was unreleased music or very small niched music or something, or Shazam totally failed?

wrxd - an hour ago

Tangential. This is a cool website, so cool that I tried to subscribe to it in my RSS reader… and it didn’t work.

If any of the authors read this message, please consider adding a RSS feed and you’ve got a subscriber!

Animats - 3 hours ago

Recognizing a recording isn't hard to do, because, for the same recording, the chords follow each other with precisely repeatable timing. That's been around for well over a decade. Recognizing a different recording, say, a, cover version, of the same song, is much more work.

Audible Magic claims to be able to recognize multiple performances of the same songs, and even parodies.[1] Using, of course, "AI technology" and much more compute.

[1] https://www.audiblemagic.com/2024/02/07/identifying-cover-so...

rmnclmnt - an hour ago

Might be the best visual explainer of Shazam original audio fingerprinting algorithm from the 2003 paper (I guess they´ve switched to ML models at some point?)

gnabgib - 4 hours ago

Again? Oh I see.. SCP (this domain is sus)

From CameronMacLeod (2022) - and much more complete analysis (587 points, 2023, 155 comments) https://news.ycombinator.com/item?id=38531428

Or Slate (2009) (50 points, 16 comments) https://news.ycombinator.com/item?id=893353

dataviz1000 - 3 hours ago

Add to my list of projects. Dinosaur game but with audible clucks to jump.

cellular - 4 hours ago

I did this for a science project in 1986 on an Apple ][c computer !

G_o_D - 3 hours ago

Out of curiosity is it possible to prevent shazam like app from detecting maybe by adding noise or any technique ?

krishna_dam - 3 hours ago

Surprised to see how that got it worked with out all the "AI" bluff

flyuk - 3 hours ago

Nice article - enjoyed reading!

blackjackfoe - 3 hours ago

No "AI" required!

SilverElfin - 2 hours ago

I feel like it does not work well. Shazam struggles to recognize music in real life environments that have some background noise, even with a lot of time. It’s much worse than the built in music recognition Google’s phones have, for example.

wood_spirit - 3 hours ago

Reminds me of Roy Van Rijn’s prototype that got a cease and desist letter! Lots of community disappointment at the time!

https://hn.algolia.com/?q=royvanrijn

dackdel - 3 hours ago

voodoo

yawpitch - 3 hours ago

This has been explained so many times… a wizard imbued the kid with the powers of Solomon, Hercules, Atlas, Zeus, Achilles, and Mercury.