This repo provides three things:
- Over 60 search simulations.
- A search query dataset based on Google Trends in 2023.
- A generated search query dataset based on LLama.
This was used for the paper Harnessing Retrieval-Augmented Generation (RAG) for Uncovering Knowledge Gaps