Show HN: Semantic search over the National Gallery of Art

nga.demo.mixedbread.com

142 points by breadislove 3 days ago


philipkglass - 3 days ago

How does this work? I thought it was probably powered by embeddings and maybe some more traditional search code, but I checked out the linked github repo and I didn't see any model/inference code. The public code is a wrapper that communicates with your commercial API?

Some searches work like magic and others seem to veer off target a lot. For example, "sculpture" and "watercolor" worked just about how I'd expect. "Lamb" showed lambs and sheep. But "otter" showed a random selection of animals.

pogilvie - 2 days ago

I built a toy version of something like this a couple-ish years ago for a hackathon. I wrote up a blog of how I did it back then for anyone interested: https://www.patrickogilvie.com/engineering/Image_Search_Engi...

Would be interesting to know how relevant that approach is now.

Trojanking - 9 hours ago

I created a similar website called https://artifair.com, where users can download high-quality artwork.

Computer0 - 3 days ago

This is neat, not sure how to report queries that are working poorly as you have mentioned. But when I search "Waltz" I am presented with Kitchen Utensils and only one piece of dancing folks. Presumably this is due to the Artist's name being 'Walton'.

kburman - 2 days ago

I recently learned that semantic search embeddings mostly represent topics and concepts, but they don’t handle negation or emotion very well.

For example, if you search for “paintings of winter landscapes but without sun and trees,” you’ll still get results with trees. That’s because embeddings capture the presence of concepts like “tree” or “landscape,” but not logical relationships like “without” or “not.”

Similarly, embeddings aren’t great at capturing how something feels. They can tell that “sad poem” and “happy poem” are different mainly because of the words used, not because they truly understand emotional tone.

This happens because most embedding models (like OpenAI’s or sentence-transformers) are trained to group things by semantic similarity, not logical meaning or sentiment. Negation, polarity, and affect aren’t explicitly represented in the vector space.

Might be common knowledge to some, but it was a cool TIL moment for me, realizing that embeddings are great at what something is about, but not how it feels or what it excludes.

nmitchko - 3 days ago

In case anyone wants to do this themselves, check out the pipeline here: https://github.com/isc-nmitchko/iris-document-search

Colnomic and nvidia models are great for embedding images and MUVERA can transform those to 1D vectors.

khaki54 - 2 days ago

Yale has an amazing one, worth looking at: https://lux.collections.yale.edu/

samdg - 2 days ago

I love old stereograms, and was happy to find a couple using this tool!

dfc - 3 days ago

It would be nice if took you to the NGA page about the item. I cant even copy the text easily for easy search.

"Images of german shepherds" never fails to provide some humor.

ted_dunning - 2 days ago

Works really well for some artist names (rembrandt, whistler) and exceedingly poorly for others (john singer sargent).

ulrikhansen54 - 2 days ago

Congrats on the launch guys. I remember meeting ya'll in SF. What happened to your HF model/project?

adamontherun - 2 days ago

love that a search for 'chill vibes sculpture' returned a very chill set of results. nice step change in art search capabilities

yawnxyz - 3 days ago

hey, your service is back up again!!! Mixedbread was my favorite tool for so long since your pivot, and I'm so glad y'all are back

kvsrh - 2 days ago

Is it possible to add other data sources?

joki77 - 2 days ago

Ketika kode dan kanvas bertemu — sebuah pencarian tak sekadar kata, tapi rasa. Di antara lukisan dan batang piksel, mesin mencoba memahami jawaban yang tak terucap.

joki77 - 2 days ago

[dead]

sfblah - 3 days ago

[flagged]