Translationaut 5 minutes ago

Why do we need vectors for search anyway? The results are often unrelated to the query. Aren't therefore exact matches better? One could also annotate the corpus with related tags and hypothetical questions, if we need more results.

minimaxir 13 hours ago

If you don't need a vector database, how are you storing the embeddings to compare the query against?

  • zcbenz 13 hours ago

    You can use any serialization that supports numbers, in the repo I used BSER (a binary format of JSON), you can even use raw JSON but it would be very inefficient for float numbers though.