Rag

All Posts

Published on
3 December 2025
Implementing Spatially-Grounded Document Retrieval via Patch-to-Region Propagation
rag retrieval-augmented-generation vision multimodal colpali document-retrieval computer-vision embeddings vector-search
'A deep dive into my recent research on spatially-grounded document retrieval using ColPali models and OCR bounding boxes, enabling precise region-level retrieval during inference time and without additional training.'
Published on
28 October 2025
Snappy: Your Vision Retrieval Buddy!
rag retrieval-augmented-generation vision multimodal colpali qdrant minio fastapi python vector-search nextjs frontend document-retrieval
How Snappy evolved from the nextjs-fastapi-colpali template into a vision-first document retrieval system
Published on
22 August 2025
You too can run the Vidore Benchmark with less than 32GB of GPU VRAM
vidore benchmark colpali rag gpu-poor pytorch mteb
Quick, practical notes to run the Vidore benchmark smoothly on a single 32GB GPU: dtype, batch size, and common OOM fixes.
Published on
15 August 2025
The Most Beautiful RAG: Starring ColPali, Qdrant, Minio and Friends
rag retrieval-augmented-generation vision multimodal colpali qdrant minio fastapi python vector-search nextjs frontend binary-quantization
An end-to-end, page-level Vision RAG template with ColPali-style embeddings, Qdrant multivector retrieval (with optional binary quantization), and MinIO-backed storage — dockerized and API-first.
Published on
17 July 2025
Audio RAG with ColQwen2.5-Omni
rag retrieval-augmented-generation audio video-processing colqwen openai gradio little-scripts multimodal embeddings semantic-search python
An audio RAG system that processes video URLs and answers questions about their content using ColQwen2.5-Omni and OpenAI audio
Published on
3 July 2025
The Most Beautiful RAG: Starring Colnomic, Qdrant, Minio and Friends
rag retrieval-augmented-generation qdrant vector-search colbert late-interaction llm python little-scripts embeddings semantic-search colpali colnomic
Introducing the first project in my little-scripts monorepo - A simple, yet beautiful RAG implementation using Colnomic, Qdrant and Nomic
Published on
30 April 2025
Mapping Worlds into Graphs with Qdrant, Neo4j, RF-DETR, BLIP-2 and Kung Fu
rag knowledge-graphs vector-search computer-vision neo4j qdrant video-processing object-detection rf-detr blip-2 entity-tracking
Diving deeper into the GraphRAG rabbit hole, I explore how to transform real-world video data into knowledge graphs using RF-DETR for object detection and BLIP-2 for intelligent entity description - setting the foundation for context-aware retrieval systems.
Published on
26 March 2025
Down the Rabbit Hole - One step closer to Production Grade GraphRAG
rag knowledge-graphs vector-search llm python nlp neo4j qdrant spark-nlp transformers nltk
After my initial experiment with GraphRAG using Qdrant, Neo4j, and Ollama, I took on a journey to build a more dynamic and context-aware system. This post dives into the details of how I constructed a dynamic ontology for NLP GraphRag.
Published on
6 March 2025
GraphRAG with Qdrant, Neo4j, and Ollama (Using Qwen2.5:3b and Nomic text embeddings)
rag knowledge-graphs neo4j vector-search llm python
I've been playing with a new approach to RAG systems - combining vector search with knowledge graphs for more contextual, relationship-aware answers. Here's what I've built, how it works, and why you might want to try it yourself.
Published on
22 February 2024
How much would it cost to store a 1 hour, 60fps 4k Video in a RAG model?
generative-ai conversational-ai rag video-to-text titanium
A simple experiment to calculate the cost of storing a 1 hour, 60fps 4k Video in a RAG model. For no practical reason, whatsoever.

Rag

rag (10)