13331112522 / v-rag
Visual RAG using less than 300 lines of code.
☆24Updated 10 months ago
Alternatives and similar repositories for v-rag:
Users that are interested in v-rag are comparing it to the libraries listed below
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 10 months ago
- ☆29Updated last year
- ☆12Updated last year
- ☆13Updated last year
- The open source community's implementation of the all-new Multi-Modal Causal Attention from "DeepSpeed-VisualChat: Multi-Round Multi-Imag…☆12Updated 10 months ago
- ☆12Updated 9 months ago
- ☆60Updated 3 months ago
- Lottery Ticket Adaptation☆37Updated last month
- BH hackathon☆14Updated 9 months ago
- ☆13Updated last year
- ☆13Updated 10 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Updated 11 months ago
- ☆20Updated 11 months ago
- Apps that run on modal.com☆12Updated 7 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆29Updated 2 months ago
- alternative way to calculating self attention☆18Updated 7 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated last month
- ☆13Updated last month
- Finetune any model on HF in less than 30 seconds☆56Updated 2 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 5 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆79Updated last year
- Seamless Voice Interactions with LLMs☆11Updated last year
- ☆30Updated last year
- ☆12Updated 4 months ago
- XmodelLM☆37Updated last month
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆21Updated 2 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year