Discussion about this post

User's avatar
Raunaq's avatar

Just implemented BM25 + Vector Search. You can check it out here - https://github.com/raunaqness/production-rag/blob/week-3-raunaq-implement-vectors/WEEK4_IMPLEMENTATION_GUIDE.md

This was such a good learning exercise!

Expand full comment
Raunaq's avatar

Some issues I found going through this why you might wanna update in the blog.

1. Delete and rebuild "rag-airflow" and "rag-api" services. I was getting DAG Import errors due to come missing python dependency. Rebuilding the image and container fixed it.

2. Remove the last comma in the query

{

"query": {

"multi_match": {

"query": "machine learning",

"fields": ["title^3", "abstract^2", "authors^1"],

"type": "best_fields"

}

},

"size": 10,

"_source": ["arxiv_id", "title", "authors", "abstract", "categories", "published_date"],

}

Expand full comment

No posts