HACKER Q&A
📣 behnamoh

Which Vector Database do you recommend for LLM applications?


There are many options out there:

- FAISS

- Pinecone

- Milvus

- Qdrant

- Weaviate

- Elasticsearch

- Vespa

- pgvector

- ScaNN

- Vald

I wonder which one is production-ready and has better features. I'm trying to stay away from Langchain, so if there are no integrations with Langchain that's fine.


  👤 andre-z Accepted Answer ✓
Qdrant Team here. It depends on you use case. For smaller data amount and just vector similarity search you can use libraries like FAISS, ScaNN, etc. If you are looking for a simple solution without high requirements on performance and just maybe a few millions of vectors you can use traditional solutions like Elastic or Postgres. Dedicated vector db solutions like Qdrant become relevant when you are looking for real-time performance, scalability up to billions and dedicated features that make your life easier. Qdrant works standalone in local (in memory), docker and (managed) cloud mode by just exchanging the host url. Best available performance https://qdrant.tech/benchmarks/ Easy to start with https://qdrant.tech/documentation/quick-start/ And there is a managed Cloud with 1GB free plan. https://cloud.qdrant.ok You are welcome to join our vibrant community on Discord http://qdrant.to/discord

👤 Chachadef
Have you tried Marqo? check the repo : https://github.com/marqo-ai/marqo

👤 zX41ZdbW
ClickHouse if you need search + analytics and full SQL support: https://clickhouse.com/blog/vector-search-clickhouse-p2

👤 abhi9u
I have been using FAISS for running https://drpawd.com. It is pretty solid, the only downside is that it's all in-memory so as your dataset grows you need to throw more memory at it.

👤 taf2
Using elasticsearch was easy since we already have it in our infrastructure - curious if others have been using it for similarity search?

👤 TalktoCrystal
Milvus seems to be the most popular one.

👤 Mockapapella
pgvector is an easy bolt-on if you’re already using Postgres