Query Expension for Better Query Embedding using LLMs
-
Updated
Feb 18, 2025 - Python
Query Expension for Better Query Embedding using LLMs
Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"
SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.
Evaluation of BEIR Datasets using ColBERT retrieval model
A genral RAG Search chatbot, with SoTA RAG techniques such as HyDE, Hybrid retrieval with BM25 + RRF and Cross encoder reranking. Evaluated on the BEIR scifact dataset and compared all the different pipelines i tried along the way
Physics-Inspired Reranking via Token-Level Point Clouds & PDE Fusion | NFCorpus NDCG@10 = 0.3232 (+47.2%) | 26ms CPU | Zero training
A decentralized cooperative memory & research layer for AI agents — collectively and cooperatively learning and advancing as a community.
GPU benchmark for RAG embedding compression and Faiss IVF-PQ ADC retrieval on FiQA / BEIR.
SERA-VQ: Discrete codes for extreme embedding compression — outperforms PCA+int8 at low memory budgets on BEIR/SciFact
RAG evaluation on BEIR SciFact: BM25, dense and hybrid retrieval with LLM answers.
Research-grade hybrid retrieval API — BM25 + FAISS + CDF calibration + entropy-weighted fusion + cross-encoder reranking. Benchmarked on BEIR SciFact with bootstrap significance tests.
A RAG system that replaces standard BM25/FAISS retrieval with a fully learned neural retrieval stack - including a fine-tuned bi-encoder, a cross-encoder reranker, ColBERT-style late interaction scoring, and a locally hosted LLM generator. Built entirely with free and open-source tools.
端到端 Hybrid RAG 系统 (BEIR/SciFact):BM25 + Dense + Weighted Fusion + RAGAS 评估 + LLM-as-judge 稳定性研究
Browser-side IR benchmark: BM25 vs Semantic vs Hybrid retrieval on SciFact (BEIR). Bauman MSTU NIR 2026.
Scripts to convert the LegalBench-RAG dataset into the standard IR format
BM25 & SBERT retrieval on the FiQA-2018 financial QA benchmark · Gradio demo
Dense retrieval + cross-encoder reranking pipeline benchmarked on BEIR datasets (NDCG, Recall@K, MRR)
Cross-Family LLM-Judge Agreement for Institutional RAG: 5 families, 9 judges. Validated on TREC RAG 2024 (kappa=0.4941) + BEIR scifact.
Argument-Aware RAG with ensemble retrieval and stance-aware structured generation for explainable fact verification (BEIR-FEVER)
Controlled depth ablation of a BERT bi-encoder across training budgets and seeds on three BEIR tasks (nfcorpus, scifact, fiqa). L3–L12 is flat within seed noise at 20K steps; 80K training degrades every depth on zero-shot transfer (−45% NDCG@10 on fiqa for L12).
Add a description, image, and links to the beir topic page so that developers can more easily learn about it.
To associate your repository with the beir topic, visit your repo's landing page and select "manage topics."