Show HN: I scraped 3B Goodreads reviews to train a better recommendation model

book.sv

599 points by costco 5 days ago


Hi everyone,

For the past couple months I've been working on a website with two main features:

- https://book.sv - put in a list of books and get recommendations on what to read next from a model trained on over a billion reviews

- https://book.sv/intersect - put in a list of books and find the users on Goodreads who have read them all (if you don't want to be included in these results, you can opt-out here: https://book.sv/remove-my-data)

Technical info available here: https://book.sv/how-it-works

Note 1: If you only provide one or two books, the model doesn't have a lot to work with and may include a handful of somewhat unrelated popular books in the results. If you want recommendations based on just one book, click the "Similar" button next to the book after adding it to the input book list on the recommendations page.

Note 2: This is uncommon, but if you get an unexpected non-English titled book in the results, it is probably not a mistake and it very likely has an English edition. The "canonical" edition of a book I use for display is whatever one is the most popular, which is usually the English version, but this is not the case for all books, especially those by famous French or Russian authors.

vessenes - 4 days ago

OK, I just added books until you told me I had too many. Fun idea! I have a couple of suggestions:

* UI - once someone clicks "Add" you really should remove that item from the suggested list - it's very confusing to still see it.

* Beam search / diversification -- Your system threw like 100 books at me of which I'd read 95 and heard of 2 of the other 3, so it worked for me as a predictor of what I'd read, but not so well for discovery.

I'd be interested in recommendations that pushed me into a new area, or gave me a surprising read. This is easier to do if you have a fairly complete list of what someone's read, I know. But off the top of my head, I'm imagining finding my eigenfriends, then finding books that are either controversial (very wide rating differences amongst my fellow readers) or possibly ghettoized, that is, some portion of similar readers also read this X or Y subject, but not all.

Anyway, thanks, this is fun! Hook up a VLM and let people take pictures of their bookshelf next.

voidUpdate - 4 days ago

Does this break part 4 of the Goodreads TOS?

"[...] you agree not to sell, license, rent, modify, distribute, copy, reproduce, transmit, publicly display, publicly perform, publish, adapt, edit or create derivative works from any materials or content accessible on the Service. Use of the Goodreads Content or materials on the Service for any purpose not expressly permitted by this Agreement is strictly prohibited."

Also did the reviewers give you permission to fed their content into an LLM?

dbl000 - 4 days ago

Echoing what everyone else has said here - awesome site, love how fast it was.

I did notice that when I put in a single book in a series (in my case Going Postal, Discworld #33) that tended to dominate the rest of the selection. That does make sense, but I don't want recommendations for a series I'm already well into.

Also noticed that a few books (Spycraft by Nadine Akkerman and Pete Langman, Tribalism is Dumb by Andrew Heaton) that I know are in goodreads and reviewed didn't show up in the search. I tried both author's name and the title of the book. Maybe they aren't in the dataset.

It did stumble with some books more niche books (The Complete Yes Minister). Trying the "Similar" button gave me more books that were _technically_ similar because they were novelizations of British comedy shows, but not what I was looking for.

For more common books though it lined up very well with books already on my wishlist!

mscbuck - 4 days ago

Awesome site and speed!

My advice from someone who has built recommendation systems: Now comes the hard part! It seems like a lot of the feedback here is that it's operating pretty heavily like a content based system system, which is fine. But this is where you can probably start evaluating on other metrics like serendipity, novelty, etc. One of the best things I did for recommender systems in production is having different ones for different purposes, then aggregating them together into a final. Have a heavy content-based one to keep people in the rabbit hole. Have a heavy graph based to try and traverse and find new stuff. Have one that is heavily tuned on a specific metric for a specific purpose. Hell, throw in a pure TF-IDF/BM25/Splade based one.

The real trick of rec systems is that people want to be recommnded things differently. Having multiple systems that you can weigh differently per user is one way to be able to achieve that, usually one algorithm can't quite do that effectively.

diffeomorphism - 4 days ago

The robots.txt is pretty explicit that this scraping is "disallowed"

https://www.goodreads.com/robots.txt

So legalities aside, this seems unethical.

blehn - 4 days ago

You should filter out authors from the input books in the output. If liked a book by an author, surely I'd read more of their work if I wanted to — recommending them isn't helpful. Along the same lines, I think interesting recommendations tend to be the ones that (1) I like and (2) I didn't expect. The more similar the recommendations are to the input, the more likely I already know them, and the more likely to create a recommendation echo chamber.

yoz-y - 4 days ago

It works pretty well in the sense that after inputting only a few quite diverse books it gave me recommendations for a lot of books that I’ve already also read and enjoyed.

I would also really like a possibility to add negative signal. It did also recommend books that seemed interesting to me but I ultimately didn’t like.

Overall quite impressive.

varenc - 4 days ago

I love this site, and the approach! Great seeing someone making good use of Goodreads data.

Sadly my experience with the book recommender isn't too great because of the 64 book limit. If I import either the most recent or least recent 64 book, 95% of the books it recommends to me are books I've read. Though it was helpful for spotting a few books I've read that I didn't log on Goodreads. Guess I'm pretty consistent.

walthamstow - 4 days ago

Works pretty well with cookbooks. Very cool work.

One suggestion would be to make the search less strict on diacritics. Searching for popular cook J. Kenji López Alt was only successful if I entered the correct O.

aj_hackman - 4 days ago

Thank you! Because of this, "The Making of Prince of Persia: Journals 1985–1993" by Jordan Mechner is on its way to my house.

mcbrit - 4 days ago

I don't know. I entered, trying to be popular but at least slightly? opiniated:

Tigana, Hyperion, A Fire Upon the Deep, Blindsight, Moby Dick

and I got a list. Sure, read all that or wasn't interested for reasons, I added (only Neuromancer on initial recommendations):

Neuromancer, VALIS, Quantum Thief, Towing Jehovah.

List did not get more interesting.

Book recommendations are still kind of difficult.

NitpickLawyer - 4 days ago

Interesting. I tested it with sci-fi, and it definitely recommends good books, but not sure how accurate it is at surfacing the sub genres / themes. For example for [aurora -ksr, seveneves, project hail mary, ender's game] it gave me dune. Which is a great book, but not in the "first-ish contact" style I hoped it would be.

Another thing I noticed is that it tends to recommend 2nd and 3rd books in a series, which is a bit so-so. If I add the first book in a series, I probably already read the whole series...

zeroq - 4 days ago

Great work!

Some five years ago I was day dreaming about recommendation engine for movies where you could say "hey Ciri, give me a good gangster flick", and it will come up with something that you haven't seen yet but you'd definitely love.

To my amazement almost everyone, even true AI believers, thought it was impossible to achieve. :d

But my question is - having such huge dataset, do we really need AI for it? SASRec/RAG is sexy, but could the same result be achieved with simple ranking and intersections like lastfm did in the past with music?

Some twenty years ago I came up with an idea of "brain" data structure for recommendations where you have all your items (books, movies or articles) modeled as a graph, and whenever you pick something it makes a ripple effect, effectively raising scores in cascade of every adjecent item.

Just like your brain works - when you stumble upon something new it immediately brings back memories of similar things from the past. I never had the opportunity to implement it and test in real life scenario, but I'd be surprised if a variant of this is not widely used across different recommendation systems, like Amazon.

MattGrommes - 4 days ago

This is cool but I'd love the option to filter out the author of the book you entered. I put in Shroud by Adrian Tchaikovsky and almost all the books are others by him, which is fine but doesn't really mix up the stuff I'm reading.

Hilift - 3 days ago

This seems like it should be an easy task for an AI to implement. For example, the question "what is the most helpful rated negative review of the book 'Original Sin' by Jake Tapper?" There are obvious and prominent "helpfulness" ratings of reviews, but they don't seem to be scraped, at least not by Gemini. Additionally, Gemini reports seemingly inaccurate or minimal effort information:

"It is difficult to pinpoint a single "most helpful" negative review of Original Sin by Jake Tapper, as helpfulness ratings on platforms like Amazon or Goodreads are dynamic and subjective, and the provided search results *do not include specific user reviews with their respective helpfulness votes*."

https://share.google/aimode/OnWrGe4j508c4u3gh

majormajor - 4 days ago

Neat! It's a validation of the model that 75%+ of the recommendations are things I've read and also enjoyed, with a few "read, didn't like" and some more "didn't read, don't really want to."

But I think to break the content-bubble effects to find the longer tail, some way to reject or blacklist things - and have that be taken into effect in the model - might help.

sosuke - 3 days ago

Feature request: Combine book series into a single entity. Bummer getting recommendations for another book in the same series as one I already liked and read.

Feature request: Exclude books already in shelf. This is harder I'm sure. I've got 1146 books in my Read shelf.

boplicity - 4 days ago

Please remove my reviews from your LLM model and training data. Thank you.

androng - 4 days ago

I tried to import my book list with "Import goodreads" button and inputting https://www.goodreads.com/user/show/68515148-andrew but it said "import failed, see console"

gen220 - 4 days ago

The best way I’ve found for finding predictably enjoyable fiction is to read interviews with the authors I like, and read about the works and authors they admire or are influenced by. Or who they exchanged letters or communications with, if they’re long dead and no interviews proper exist.

Strongly recommend giving that a try yourself. And trying to build an algorithm around it!

Here’s an example: Tolstoy really admired Turgenev, who was friends with Theodore Storm and Gustave Flaubert, and greatly admired Gogol.

If you like Anna Karenina you’ll probably find something of value in Torrents of Spring, Immensee, Madame Bovary or Dead Souls.

It spiders out pretty quickly!

wtf242 - 3 days ago

That's super cool. I launched a book recommendations feature this year, which works vastly different. I ask users what their favorite books are(which you can rank), and then allow them to import their goodreads data which includes star reviews, and books they have read, then I determine your favorite style of books based on genres and subjects, then use opensearch to find similar books. It's a lot more complicated than that, but seems to work well. I'm always looking for ideas on how to improve this feature. Interested in what you are actually doing on the backend on the how-it-works page. thanks!

here's my recommendations feature: https://thegreatestbooks.org/recommendations

It's much more powerful if you're a member. you can restrict the results to certain genres or book lengths, as we as published date ranges, etc. If someone wants to try out the more powerful feature, DM me and i'll mark your account as a member.

simlevesque - 4 days ago

The How it works it way too short :) I'd love to see some scripts, know the hardware you use, etc...

By the way you could use Summa FTS Wasm + Duckdb Wasm to have the same website without any backend except file hosting. Maybe even just Duckdb Wasm with it's FTS would be enough. Summa FTS is very similar to meilisearch in essence because they're both derived from Tantivy.

https://izihawa.github.io/summa/quick-start/

sexylibrarian - 4 days ago

We've been working on this data set since 2016 and have it covered! Our app is on test flight in private beta and will be sharing it very soon xo

zeroonetwothree - a day ago

It would be nice to have a way to filter to authors not included in the input. Generally it’s not hard for me to realize that if I like one book by an author I might like another. Finding different authors is where I need help.

foresterre - 4 days ago

It seems to work decently even with just one or two titles for popular titles, but less so for the niche.

For example, the title "Impro: Improvisation and the Theatre" by Keith Johnstone, linked by another article posted to HN today gives back the following suggestions:

- Truth in Comedy: The Manual of Improvisation by Charna Halpern - Steve Jobs by Walter Isaacson - 1984 by George Orwell - Harry Potter and the Sorcerer's Stone (Harry Potter, #1) by J.K. Rowling - Sapiens: A Brief History of Humankind by Yuval Noah Harari - The Alchemist by Paulo Coelho - The Tipping Point: How Little Things Can Make a Big Difference by Malcolm Gladwell - Dune (Dune, #1) by Frank Herbert

It's a bit unfortunate that all suggestions are fairly popular titles, which are fairly easy to find, while the unpopular or niche may be just as well written but a lot harder to find.

Within niche topics or books, it is also usually harder to provide multiple similar enough titles up front.

contravariant - 3 days ago

For the intersect page you probably want to order the users by the size of their shelf. For some more obscure combinations I'm mostly getting users who read 10,000s of books, which is less useful than the users with <1000 books.

jauntywundrkind - 4 days ago

Where do nice scrapes like this end up? Are there BitTorrents out there for scrapes like this?

Honestly this would finally be the web2.0 we all wanted & hoped for. It's against majesty that it's all captured owned user content that is legally captured by essentially public message boards/sites.

tgv - 4 days ago

It seems you cannot reject recommendations. Based on two books, the system showed reasonable recommendations, some of which I'd read, including one that I really didn't like. There must be useful information in that, too.

skayvr - 4 days ago

I've worked in recommender systems for a while, and it's great to see them publicized.

SASRec was released in 2018 just after transformer paper, and uses the same attention mechanism but different losses than LLMs. Any plans to upgrade to other item/user prediction models?

greenie_beans - 3 days ago

so sick!! always wanted to do something like this but would use it for my saas (https://bookhead.net), so i'm hesitant to scrape goodreads due to their terms of service.

in addition to the goodreads data, i think it would be interesting to add an author's favorite books by scraping their paris review interviews (and other interviews) and using that as a "review" because i've learned about so much good stuff through an artist mention.

and in a retail context, i've always wondered if a recommendation engine like this could have its own "flavor" based on a specific store's customer buying history. like if a bookstore's customers were weighted in the algorithm so that their similarities scores were given preference. much of what a bookstore carries is based on their customers' taste. you could use the goodreads etc as the base recommendations and then train it on a bookstore's sales history.

a project like this is a bit outside of my expertise but i have a tiny bit of knowledge about it, and now i have a lot more to learn after seeing this. if anybody has any good book recommendations (hehe) or papers i should read to learn about these sort of systems, please let me know!

thank you for sharing!!

fsmv - 3 days ago

Cool so you stole my data and now you're bragging about it?

sajb - 4 days ago

You could color code results from the same author or from the same series as an already added book, since the user most likely already knows about them. Perhaps a toggle to filter these out altogether.

sodality2 - 4 days ago

This is fantastic!!! I've added many results to my want-to-read list, they're very on-point from very few inputs. It would be really cool to import from a user ID, where you can choose some subset of your read list to inspire new suggestions, while excluding all books in your want-to-read and already-read lists. But that's an ongoing scrape to maintain, it's a cat and mouse game you probably don't want to start. I wonder what the legal status of scraped training data is... if you don't reproduce any of the review data I presume you're fine?

Invictus0 - 3 days ago

The problem with recommender engines is they're always recommending the most popular books that are in the same vein as what you've already read. So you're always getting pop-culture pap and not actually-interesting, somewhat more niche and unrelated books that are only tangentially related. The recommendations I got were all pop-psych stuff and other titles by the authors I've already read.

jamesponddotco - 4 days ago

The recommendations are pretty good; even though I only input six books, it was enough for it to recommend books I have on my wish list. Definitely going to play around some more. Plus, the website is super fast, very impressive.

Any chance we could get an API going at some point? Are you planning to open source the work?

I'm interested in the scrapping of Goodreads too. I'm building a book metadata aggregation API and plan on building a scrapper for Goodreads, but I imagine using a data center IP address will be a problem very fast. Were you scrapping from your home network?

illdave - 4 days ago

This is really great - refreshing to have something that's instantly useful, with no need to signup/login. Really fast, immediately helpful - this is wonderful.

- 3 days ago
[deleted]
xkbarkar - 4 days ago

Have nothing to add that hasn’t already been commented. Like the entries in the add list stay. Other than that, my recommendation list keeps coming up with books I have already read and loved and I am hitting the limit :(.

So filtering would be great,

I have seen a few versions of the same books listed more than once.

Loved this. Hope you get to tune it a little.

Also, thank you for not ruining the site with a single popup, email subscription list offer, chatbot, wheelspin from hell anywhere.

Blessings from the popup hating part of the interwebs.

rapatel0 - 4 days ago

So I tried a few disparate books independently:

- Guns Germs and steel - The Alchemist - The Ramayana (a few others)

Harry Potter and the sorcerers stone came up in all of them near the top. :D

MollyRealized - 2 days ago

So, I'm not anyone related to them, but considering you just kind of baldfaced admitted to it right there in your title, I'd get some lawyers on retainer.

loremm - 3 days ago

For intersect I also wonder if you add a filter that the books are within the top rated. Like if I give my favorite books and want to find someone who has my same taste, it doesn't help if they hated (all/most/some) of those books. Tricky in that not all users give star ratings

fridental - 4 days ago

I've entered books from The Expanse and Lockwood & Co series and its output was not really overwhelming: - other books from the series (duh, I don't need a recommender for that recommendation) - Hobbit, Harry Potter, Azimov etc (duh, I like scifi and surely I've already read all the classic works).

_virtu - 4 days ago

Hey OP I’m building a bookclub app. Do you happen to have an api I could plug into? I’d love to add this to our member suggestions section.

tristor - 4 days ago

Two bugs to know about. First, you are using a deprecated API call that fails in Firefox. Second, you are using an HTTP endpoint that fails to upgrade to HTTPS to call the GoodReads API, which also fails with HTTPS-Only enabled in both Chrome and Firefox.

The idea seems good, but since I can't import my GoodReads successfully, it's hard for me to try

nsypteras - 4 days ago

I'm impressed it recommended so many books i've already read and liked! I have a big reading backlog but once it's whittled down I will likely come back to this. One feature request would be to also show a "why this is recommended" for each recommendation so I can further narrow down the list for what I'm looking for

foota - 4 days ago

Many years ago I built a D3 graph based explorer for movies using the IMDB API. You would start by entering a movie title and it would pull up the similar movies from IMDB and you could click them and see more similar ones and they would all be connected based on similarity. It was very fun!

qingcharles - 4 days ago

I put in a bunch of books and hit recommendations and... I'd already read 95% of them, so at least we know it works well! (checking out the other 5% now)

p.s. one idea: when you click [Add] on the recommended books list, it should remove it from that list

p.p.s. if there is a way to filter out the spam "Summary of ____" books, that would be good too

easywood - 4 days ago

Thank you very much, I have always wished for something like this to be part of Goodreads itself. The intersect function especially will help me find hidden gems that other likeminded people have found. I'm looking forward to find out what books I have missed all my life.

spullara - 4 days ago

I would love to be able to filter the resulting list by removing certainly all books that in the same series but I think removing all books by authors that I have already listed would be great to get new things that I haven't already read. The resulting recommendations maybe included 1 new book for me.

noir_lord - 4 days ago

It has a tendency to recommend books in the same series as are input (putting aside that if I like a book in a series I've likely already read the series).

It did suggest Murderbot Diaries (not on the input but a series I have read and did like) and an Adrian Tchaikovsky I hadn't read :).

nickthesick - 4 days ago

I have a web app https://bookhive.buzz which is a GoodReads alternative based on BlueSky’s protocol. I scrape all of the book data from Goodreads too.

I would love to be able to add a recommendation system based on this.

laszlojamf - 4 days ago

I gave it a spin with Gravity's Rainbow. Most of the recommendations are what you'd expect, Pynchon himself, Don Delillo, David Foster Wallace... and then right at the end... The Hobbit, or There and Back Again >.<

maxglute - 4 days ago

Would be nice to not recommend books by author already added for novel discoverability.

comrade1234 - 4 days ago

I gave up on goodreads reviews. I've been burned too many times by highly rated books that weren't that good. If you're into (horny) ya romance fantasy then goodreads is great, but it's not for me. I haven't really found a substitute.

Jayakumark - 4 days ago

Cool work, How much did it cost to train ? Will the source for training be open source ?

cyrusradfar - 4 days ago

I think this is cool and super fast -- kudos on whatever tech you needed to tackle to make it so.

I don't see anyone saying safety or ethics, so I'll just put it out there that it has some safety and ethical considerations you should consider.

Consider "inflammatory" books and how they could be used to harm a group of people. Although I recognize folks post this "publicly", I think the intersection feature provides more than Goodreads.

Let's say, people who have read "Mein Kampf" & "The Anarchists Cookbook" or some other combination that say "Antifa" to the current regime.

I'd recommend you have a list that you consider private, always and allow Users to add to that list so it's more scalable. If folks try to intersect with anything in that list, you can warn that you don't allow intersection with private books.

Anyway, super fun demo!

giobox - 4 days ago

Considering how much treasure has been poured into building recommendation engines for just about everything online, books have always been very difficult for me to find recommendations that work. Interested to try it!

mring33621 - 3 days ago

FEEDBACK:

I should be able to mark recommended books as "Read, Liked"; "Read, Didn't Like"; "Remove, Other Reason"

and then allow a rerun of Recs, based on additional info

josvdwest - 3 days ago

Super cool tool!

Feature request: Be able to import all my goodreads books, unread as well. Not only 64. Most of the recommendations were already on my shelf.

nwhnwh - 4 days ago

I entered "Alone Together: Why We Expect More from Technology and Less from Each Other" and I received books about Steve Jobs, Harry Potter and "The Subtle Art of Not Giving a F*ck". Like how???

jimmoores - 4 days ago

I unexpectedly liked this. I thought the recommendations were actually useful.

cfraenkel - 4 days ago

FYI, on this android tablet (android v12 / FF 144.0.2), the 'start typing a book title...' field doesn't do anything. On the Mac, it brings up a list of matches to select from.

Svoka - 3 days ago

Honestly, with this I see same results as with any other recommendation system - I type some nice Sci-fi/Fantasy I read, it give me generic Sci-Fi fantasy I already read. Even those I really didn't like.

I add those I like to the list, ignoring those I didn't, and in the end I just end up with recommendations I already read and didn't like.

I feel like wasted my time yet with another smart recommendation system.

billfruit - 4 days ago

I feel that the last added book in one's list seem to have more influence on the recommendations, which results in a rather similar type of recommendations.

- 4 days ago
[deleted]
__alexander - 4 days ago

Care to share the scrapped data? I would love to play around with it.

bossyTeacher - 3 days ago

You scrapping Goodreads to make a replacement is like your company making you train the new hire so he can replace you eventually

freen - 3 days ago

The commons of the creative output of humanity is a resource, just like oil or lithium.

We are rapidly replaying the worst of the Resource Curse.

aaronax - 3 days ago

Can you create a list of the most common book names? I think it would be funny to have a shelf of books that all have the same name.

submeta - 4 days ago

Like the idea! Wondering: Weren’t the early LLMs trained on data in Goodreads as well? I can upload and ask ChatGPT as well, and it will give me similar recommendations, no?

thinkcontext - 5 days ago

I'm impressed! It didn't take many books for it to start suggesting other books that I liked and it showed me several solid choices I'm adding to my queue.

fennec-posix - 4 days ago

Very neat. Even found a couple Cold War-setting books to read and an entire series of 6 books on the same topic, All from searching up Team Yankee.

Thanks for the new reading list :D

johnsillings - 2 days ago

it would be a really nice product experience if you could click on a book (in the recommendations), then get recommendations for that book, and so on

esafak - 4 days ago

It is interesting that you chose a contextual recommender when you would think book affinity is not very susceptible to context. Did you try other models too?

smcleod - 4 days ago

Very neat! By chance have you open sourced the code and model anywhere? I'd be really keen to have a play with this.

aidenn0 - 3 days ago

I have "too many books" to add more to books I like and I've read almost all of the recommendations...

mhb - 3 days ago

Do you have any thoughts on how SASRec compares with the SVD-based Cinematch algorithm?

mayahisali - 3 days ago

Curious about how you trained the model on a billion reviews. What architecture did you use?

iamcreasy - 4 days ago

Very fast. Thanks for building it.

Besides title, I'd like to provide suggestion on what type of books I am looking for.

logicprog - 4 days ago

I tested the model out, and I can attest that it's very on point! It gets the assignment

caro_kann - 4 days ago

I tried three different genre books, for each I got 1984 - George Orwell recommendation.

mpern - 4 days ago

I read the title as "I scraped 38 book reviews". Time to get reading glasses...

djoldman - 4 days ago

Can you share the details about the Meilisearch instance? How big is the box and database size?

skerit - 4 days ago

Please make this for tv series too!

stevage - 4 days ago

This is great. would be really nice to be able to reject suggestions though.

djent - 3 days ago

Why not scrape the content of the books too?

Llamamoe - 4 days ago

My impressions are similar as those of others:

- It seems to mostly show me books I've already read and know of, including sequels of what I added, which isn't very useful.

- It ultimately seems to prioritize "highest rated in category" too much, rather than focusing more on what made my chosen books stand out over others.

- Needs a "disliked books" list, especially when the recommendations show me a lot of superficially similar books I hated. I'd like to blacklist them.

- Would be cool to have a discovery mechanism for less popular and even obscure titles. Again, the top picks of each category are very well-known.

- Might not be practical, but I'd like some way to filter by specific features of reviews. E.g. prioritize reviews that say "the MC is a psychopath/murderhobo/rapist" higher for anti-recommendation, ignore reviews that say "whiny character", etc.

the_coffee_bean - 4 days ago

Amazing work. Do you plan on publishing the training code?

- 4 days ago
[deleted]
calebt3141 - 3 days ago

This is fantastic. Thanks for sharing this.

hmokiguess - 3 days ago

People of HN: - Remove my data, unethical, my words are mine! Argh!

Also People of HN: - I built an HN aggregator that shows sentiment analysis of comments and . . .

SilverSlash - 4 days ago

Bless this man! I despise Goodreads but continue to use it, because there are no real alternatives. It feels like they outsourced the creation of that website to some cheap consulting agency in a low cost location and then left it at that. For example, Goodreads hasn't updated its outdated version of React in years.

For a while now I have really wanted good book recommendations matching my tastes. The LLMs suck at this (likely due to the mode collapse that Karpathy mentioned in his excellent podcast appearance on Dwarkesh) and Amazon is very good but only recommends based on the current book you're browsing.

I will try this out now! But could you increase the number of books fed to the recommender or maybe get the top-64 highest rated books instead of just the most or least recent 64?

mna_ - 4 days ago

I typed "Introduction to Real Analysis by Bartle" and I got:

Steve Jobs by Walter Isaacson

Harry Potter and the Sorcerer's Stone (Harry Potter, #1) by JK Rowling

Topology by James R Munkres

and so on.. Munkres' book is relevant and I want to read it, but what have Steve Jobs and Harry Potter got to do with with mathematics?

stichers - 4 days ago

I don't get what "Add" does

momocowcow - 4 days ago

Whatever I put in, it wants me to read Sapiens :_(

lifeisstillgood - 4 days ago

Goodreads - “hey those user written comments belong to us, you need to pay us”

HNUser - “OpenAI told you to go swivel until they made a billion and you accepted that. Samesies “

mhb - 3 days ago

Import doesn't work for me.

atomicnature - 4 days ago

Will this be open source?

dbingham - 4 days ago

See, now this is an excellent use of LLMs (if we're going to be using them at all). Low stakes if it gets shit wrong, but can provide some really useful and surprising answers!

One request, it would be nice to not have to add Goodreads, since I don't use it. I've love to be able to enter a couple of book titles or an author and just get recommendations!

tecleandor - 3 days ago

sigh No thanks. I rather not have my comments used by random LLMs.

deanc - 4 days ago

Looks cool, and no bullshit. Please let us filter recommendations. If I put in a non-fiction book I'm probably looking for recommendations of other non-fiction books :)

conartist6 - 4 days ago

SHAME. Gross. Morally bankrupt. Greedy.

piskov - 4 days ago

Tried and unfortunately it was meh.

brailsafe - 4 days ago

In some sense, it seems to work well, but the results are sort of nothing special and that's not what I'd personally hope for. I put in three books that are unrelated and got results that compare to a standard book store, either from the same series or other meme startup tech bro recommendations that I'd often literally see on the same shelf. I can't say it's not good, because obviously that's how people browse books and that's what you'd get from reviews, which is perhaps why I never consult reviews for anything.

I put in Thinking in Systems and got a bunch of engineering management stuff which I don't care about. Deep work of course gave me all the rich dad poor dad, steve jobs bio, tim ferriswheel crap which shouldn't surprise me at all. Girl with the dragon tattoo gave me the rest of the series.

Thematic similarity + popularity just seems boring, I'd like something that surfaces unusual deep cuts that I wouldn't necessarily find at the book store on the same shelf, but maybe that I could find if I went to a great library and might be out of print, or that I could find on libgen.

With these:

- Thinking In Systems: A Primer

- Paddle to the Amazon: The Ultimate 12,000-Mile Canoe Adventure

- The Elements of Typographic Style

I was kind of hoping to at least get "Grid Systems in Graphic Design" or something, but mostly got Alchemist, Zen', Into the Wild, almost comically mainstream cuts that of course in some cases I've already read or could find in a Cupertino trash can, not that any of them are not worth reading necessarily, but very typical.

An option to surface rarer choices that combine signals from all the books on the list would be neat, like in the above case, the least read real adventure book that somehow touches on the economics of places travelled through with musings about signage or that just happens to use a similar prose that Robert Bringhurst used to make print design theory not dull. Recommendations that only someone with a real sweaty and weird venn diagram of genuine personal deep interests might conjure up, and that a normal person might say "why the hell would I ever read that" but that otherwise amazing books that are just slept on and might never have found a market, or maybe thematically dissimilar+ conceptually similar in aggregate + unpopular. I'd like to be able to input a seed of inspiration that I haven't been able to find the next deeper step in, rather than all the books on how to start a startup in the garage I don't have. If it's James Hoffman's book on brewing coffee at a high level, I wouldn't want another YouTubers book on brewing coffee at a high level, I'd want the Physics of Filter Coffee, or something in an adjacent sphere grid / tree branch that gives me a way to pursue depth AND breadth but not necessarily the same book by someone else, or the same book with different characters. If I've found a seedling or a mushroom, I'd like to explore the root system of that fruiting body, and then at a certain point find a new seedling based on what I've learned so far, or the one video with 50 views that's somehow the best explanation of how to handle back-pressure in highly concurrent systems after I've realized that I don't know shit about concurrency, but not so deep in the stack that I can't bridge the gap; make the series for me.

Granted, my take here might just be an indictment of reviews in general, or at least those sourced from a generic site like goodreads/amazon which is all about popularity and armchair criticism.

- 5 days ago
[deleted]
maxtoc - 4 days ago

[dead]

slipperybeluga - 3 days ago

[dead]

sexylibrarian - 4 days ago

[dead]

6stringmerc - 3 days ago

Wow I wonder how fun conversations in the afterlife with Aaron are going to be for the OP. There are ways to improve broken systems. Pimping them out with glee is not one of them in my book.

Everything about this concept I hate and it’s difficult not to conflate that with the creator. I make comparisons and equivocations. This is an ethical discussion akin to “Can you enjoy Bill Cosby comedy knowing he was a Rapist” and I’m not being glib.