☆33Aug 28, 2024Updated last year
Alternatives and similar repositories for Speculative-RAG
Users that are interested in Speculative-RAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Sep 16, 2025Updated 8 months ago
- Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".☆28Feb 10, 2025Updated last year
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆16Mar 2, 2026Updated 3 months ago
- ☆27Feb 23, 2026Updated 3 months ago
- [EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners☆19Nov 17, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Data and code for the paper: Finding Safety Neurons in Large Language Models☆28Jan 29, 2026Updated 4 months ago
- Enhancing contextual understanding in large language models through contrastive decoding☆19May 3, 2024Updated 2 years ago
- Source codes of uAgents and uAgent-based applications☆39May 28, 2026Updated last week
- Diff filtering, text mapping, and windowed transforms for LLM apps☆22Jun 2, 2026Updated last week
- 🔌 Want one client library for all your embeddings? 💙 Choose Catsu! 🐱☆69Apr 21, 2026Updated last month
- ☆23Mar 1, 2025Updated last year
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- This project is my attempt at automating work in Notion.☆17Aug 28, 2025Updated 9 months ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆29Jun 16, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR 2025] QuartDepth☆18Mar 24, 2025Updated last year
- AAAI 2025: Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs☆18Nov 9, 2024Updated last year
- An open-source server implementation for inference Qwen2-VL series model using fastapi.☆10Nov 20, 2024Updated last year
- A collection of example AI programs built using DSPy and maitained by the Langtrace AI team.☆54Nov 20, 2024Updated last year
- ☆16Dec 9, 2023Updated 2 years ago
- ☆14Jul 14, 2025Updated 10 months ago
- Code and data for paper "Large language models can rate news outlet credibility"☆13Aug 10, 2024Updated last year
- ☆15Apr 11, 2024Updated 2 years ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆49Apr 23, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Stochastic Multiple Target Sampling Gradient Descent (NeurIPS 2022)☆13Sep 19, 2022Updated 3 years ago
- ☆11Mar 13, 2023Updated 3 years ago
- [CVPR 2026] MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent☆32Apr 30, 2026Updated last month
- ☆10Apr 24, 2022Updated 4 years ago
- ☆17Mar 10, 2025Updated last year
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 11 months ago
- ☆15Jul 25, 2024Updated last year
- ☆38Nov 13, 2025Updated 6 months ago
- ☆23Dec 16, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Jul 30, 2025Updated 10 months ago
- A library for structural-semantic chunking of documents.☆13Oct 8, 2025Updated 8 months ago
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that …☆13Nov 20, 2024Updated last year
- Ssebowa is free and open source library in Python that provides generative-ai models.☆15Jan 31, 2024Updated 2 years ago
- ☆44Dec 14, 2024Updated last year
- ☆11Sep 20, 2024Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Jun 1, 2026Updated last week