ThalamusDB: Query text, tables, images, and audio

github.com

52 points by itrummer 5 days ago


ilaksh - 2 days ago

Does this use CLIP or something to get embeddings for each image and normal text embeddings for the text fields, and then feed the top N results to a VLM (LLM) to select the best answer(s)?

What's the advantage of this over using llamaindex?

Although even asking that question I will be honest, the last thing I used llamaindex for, it seemed mostly everything had to be shoehorned in as using that library was a foregone conclusion, even though ChromaDB was doing just about all the work in the end because the built in test vector store that llamaindex has strangely bad performance with any scale.

I do like how simple the llamaindex DocumentStore or whatever is where you can just point it at a directory. But it seems when using a specific vectordb you often can't do that.

I guess the other thing people do is put everything in postgres. Do people use pgvector to store image embeddings?

AmazingTurtle - 2 days ago

You say it's a DB, given the execution time of up to 600s per query, I say: its an agent.

catlifeonmars - 2 days ago

Dumb question: why is this its own DB vs being a Postgres extension (for example).

- 5 days ago
[deleted]
tarwich - 2 days ago

What a cool idea

satisfice - 2 days ago

How is it tested?

cyanydeez - a day ago

Bizarre coding solutions that reqhire OPENAI

petre - 2 days ago

Seems like a good tool for police work.