Tag: rag
All the articles with the tag "rag".
-
Implementing Spatially-Grounded Document Retrieval via Patch-to-Region Propagation
A deep dive into my recent research on spatially-grounded document retrieval using ColPali models and OCR bounding boxes, enabling precise region-level retrieval during inference time and without additional training.
-
Snappy: Your Vision Retrieval Buddy!
How Snappy evolved from the nextjs-fastapi-colpali template into a vision-first document retrieval system
-
The Most Beautiful RAG: Starring ColPali, Qdrant, Minio and Friends
Updated:An end-to-end, page-level Vision RAG template with ColPali-style embeddings, Qdrant multivector retrieval (with optional binary quantization), and MinIO-backed storage — dockerized and API-first.
-
You too can run the Vidore Benchmark with less than 32GB of GPU VRAM
Quick, practical notes to run the Vidore benchmark smoothly on a single 32GB GPU: dtype, batch size, and common OOM fixes.
-
Audio RAG with ColQwen2.5-Omni
An audio RAG system that processes video URLs and answers questions about their content using ColQwen2.5-Omni and OpenAI audio
Athrael.net