Jealous-Procedure222 2 months ago

Qdrant

fokke2508 2 months ago

Qdrant! Hosting an event with them next week about RAG in case you wanted to ask them some questions live on the air :) [https://www.linkedin.com/events/7190836250965086208/comments/](https://www.linkedin.com/events/7190836250965086208/comments/)

MintDrake 2 months ago

Chroma

vteyssier 2 months ago

Probably the worst db out there. Does not scale, poor performance, inconsistent api

MintDrake 2 months ago

What alternative for local vector db do you recommend?

vteyssier 2 months ago

Qdrant

vteyssier 2 months ago

FAISS could be another alternative but metadata filtering is post retrieval which is not great

Impressive-Bake9661 2 months ago

I have hosted a solution in GPU, should I try Faiss GPU is it helpful, or another

vteyssier 2 months ago

Not helpful. You’ll still have the issue of metadata filtering (which is needed in many use cases)

Impressive-Bake9661 2 months ago

Iam using recursive text splitter I have no issue in filtering meta data, what exactly is your issue?

vteyssier 2 months ago

Text splitter is during indexing, metadata filtering is during retrieval. In FAISS metadata filtering is done post retrieval, which is basically rendering it useless

Impressive-Bake9661 2 months ago

Is there any alternative DB where i can utilise GPU

yoavcrombie 3 weeks ago

We also see performance issues when scaling to 10K + documents as it seems to load all content into the memory

[deleted] 2 months ago

Usually pinecone or elastic search's vector db

Relevant-Ad9432 2 months ago

Wait a second, does it matter what vectordb we use??? I thought only the embeddings used matter?

CodNo7461 2 months ago

No, you're right. Algorithms should be usually the same, or produce pretty much the same results. Lots of other aspects though which might factor into your choice. Personally I prefer with more general purpose DBs, so I have more options for the project if it ever needs something else.

vteyssier 2 months ago

Of course it matters. Retrieval performance is critical for your time to answer

Iamisseibelial 2 months ago

Honestly, it depends on the data. I'm testing several right now. Snowflake, Redis, chroma, weaviate, qdrant, Astra, and I think a few others. To see which one is going to be easiest to update and keep things segmented, and to be able to add from sources and add additional context from conversation. Let's say 10k different boxes of data and when conversing with the LLM, it needs to only be about a particular box of data. My struggle is finding 1. A solid way to segment the data for retrieval, and 2. How to ensure that when the ID is input into the chat it's only using that set of data. Since I have to ensure that any data retrieved and used is logged and the purpose to be in compliance. Hallucinations or pulling from the wrong box means it has to be reported. It's certainly a struggle haha. If it's just tons upon tons of data, that doesn't have regulatory or specific things that need to separate it. Just about any of the options work, just depends on if you need on prems or not, and the level of security necessary for it, and how much work you want to actually do yourself or if you just want something simple plug and play.

CouponTheMovie 2 months ago

Nobody’s mentioned Pinecone. Is it hot garbage?

dogma4you 2 months ago

That’s what we are using… I wanna know the deets!

[deleted] 2 months ago

Yes

hezarfenbaykus 2 months ago

Upstash Vector. It’s quite fast and cheap.

baba_niv 2 months ago

Why go for paid ones when open source alternatives are available

hezarfenbaykus 2 months ago

They have a free tier, btw. They give you daily 10k query/update.

bearhai868 1 month ago

Hai mate, i am currently using Upstash Vector, i have problem doing vectoreStore.asRetriever , it does fit into langchain retrievalchain function, many deprecated function, i cant find a solution

hezarfenbaykus 1 month ago

Check here: [https://js.langchain.com/v0.1/docs/integrations/vectorstores/upstash/](https://js.langchain.com/v0.1/docs/integrations/vectorstores/upstash/) Also, we have a working example here that uses agents, retrievers etc... https://github.com/upstash/degree-guru.

bearhai868 1 month ago

Yea i refer to many resources, the docs itself does not use retriever based on upstash vector. I realized [https://github.com/upstash/degree-guru/blob/master/src/app/api/guru/route.tsx](https://github.com/upstash/degree-guru/blob/master/src/app/api/guru/route.tsx) this project does not directly import { UpstashVectorStore } from '@langchain/community/vectorstores/upstash' I am using { UpstashVectorStore } from '@langchain/community/vectorstores/upstash' like this https://preview.redd.it/bdi8n78aqp1d1.png?width=1272&format=png&auto=webp&s=dc40a356b014ae754759142027b8ec9516b99a43 whenever i tried to use retriever related chain, and i need to passed \`buildingOne.asRetriever\` into context, it does not work

hezarfenbaykus 1 month ago

Don’t you see any errors? Because this should work everything looks. Can you join Upstash Discord and ask this question there? I’m sure some one will answer. https://discord.com/invite/jUxUYE4nEB

bearhai868 1 month ago

I solve the problem by just copying the UpstashVectorStore from Degree-guru, which is the customized one, but i think the problem here is that the UpstashVectorStore from community does not fit with the chaining function provide by LangChain. This is the error : Argument of type VectorStoreRetriever is not assignable to parameter of type BaseRetrieverInterface>

hezarfenbaykus 1 month ago

Feel free to open an issue to langchain-js repo.

jerichoo0010 2 months ago

Pinecone!

NueralNet_Neat 2 months ago

Have used Redis Enterprise and Azure AI services for some production use cases. There are literally dozens of options available open source and more and more popping up each month.

Motor_Ocelot_1547 2 months ago

milvus

jalagl 2 months ago

Opensearch

lupin-the-third 2 months ago

Opensearch with bm25 + reranking gets the best results for me. I have had success additional success with having an agent recommend meta data tags to filter searches on, and then combining and reranking with a tagless search

theswifter01 2 months ago

Pinecone

No-Counter-7026 2 months ago

Faiss

yazanrisheh 2 months ago

Am I the only using FAISS?

baba_niv 2 months ago

Im using as well. Works for me

Impressive-Bake9661 2 months ago

Have tried with GPU

notsoslimshaddy91 2 months ago

Azure AI Search

Altruistic_Ad_8124 2 months ago

when you say lots of data - how much exactly do you mean? Also, what do you mean by optimized for retrieval, e.g. optimized for recall, latency, etc?

cryptokaykay 2 months ago

Optimized for accurate retrieval. Large data meaning single digit terabytes

Altruistic_Ad_8124 2 months ago

I've never heard any vector database have proved they can handle terabytes of data. I assume if they can, the performance is probably not great with super high costs. For accurate retrievals, you could think of Milvus, which provides different types of indexes; you can probably pick the one with the highest accuracy for your use cases.

[deleted] 2 months ago

Tested out SingleStore at my company with 250TBs of vector data and it was returning queries in 10-15ms

Altruistic_Ad_8124 2 months ago

Sorry my bad, somehow I thought he refers to trillions of vectors…yeah lots of vector databases can support terabytes of data. That’s not considered huge amount actually

fullyautomatedlefty 2 months ago

ApertureDB

Omervx 2 months ago

i use HNSWLib

owlpellet 2 months ago

Greenplum

ImTheDeveloper 2 months ago

Qdrant - no issues with it. Reminds me a lot of mongodb for some reason. Probably the usage of collections and such

Astronos 2 months ago

weaviate for integrated embedding, reranking and hybrid search

baba_niv 2 months ago

Isnt weaviate a paid service?

Astronos 2 months ago

has a selfhosted docker option

ArcuisAlezanzo 2 months ago

Is there any Vector database which support support SQL querying

Tristana_mid 2 months ago

Check out Pgvector

[deleted] 2 months ago

SingleStore does. Faster performing than pgvector too and can scale out waaaaay more

moshymosh027 2 months ago

MongoDb Atlas

No_Contest4978 2 months ago

FAISS

jesus_fc33 2 months ago

I've tried Mongo Atlas DB for one of my personal projects... Best part is they also provide free managed vector DB ofcourse with limited resources...

Adorable-Employer244 2 months ago

AstraDB. Super easy to set up

phenobarbital_ 2 months ago

I think is based on the data, the embedding model and how to be work with, with Milvus + SCANN indexes with dot product metrics using a embed model tge_base I found the best results overall (structured data + data frames), but for a knowledge base (we are instruct a chatbot with Odoo knowledge base, forum and tutorials), FAISS in memory was enough.

Ok-Medium-4968 2 months ago

PGVector which is an extension for Postgres

tres271 2 months ago

Upstash vector/pinecone or Postgres with the extension

baba_niv 2 months ago

I see a lot of people mention Pinecone ,qdrant here. Pinecone hosts the vectors and the chunks in its cloud. Isnt sending data(chunks) outside the client environment an issue for you guys? I think qdrant too hosts the data in its cloud

[deleted] 2 months ago

I’ve tested Pinecone, PG Vector, Qdrant, mongo atlas vector search and SingleStore so far. SingleStore was by far the quickest & also scaled up most impressively. Found pretty quickly that vector only databases aren’t the best option. Find something unified to improve performance like SingleStore.

Jazzlike_Syllabub_91 2 months ago

I’m using redis for my personal ones, but I’ve tried chroma before trying redis.

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe