☆33Aug 28, 2024Updated last year
Alternatives and similar repositories for Speculative-RAG
Users that are interested in Speculative-RAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".☆28Feb 10, 2025Updated last year
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 2 months ago
- Data and code for the paper: Finding Safety Neurons in Large Language Models☆27Jan 29, 2026Updated 3 months ago
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆20Aug 5, 2025Updated 9 months ago
- Turning messy repos into weapons of mass structured context.☆22Feb 20, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 🔌 Want one client library for all your embeddings? 💙 Choose Catsu! 🐱☆66Apr 21, 2026Updated 3 weeks ago
- Source code of our paper MIND, ACL 2024 Long Paper☆65Nov 14, 2025Updated 6 months ago
- ☆11Feb 26, 2021Updated 5 years ago
- ☆13Jul 20, 2023Updated 2 years ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- ☆11Apr 5, 2023Updated 3 years ago
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆12Jul 9, 2025Updated 10 months ago
- This is the official repository for our paper "Doctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement Learning" pu…☆32Apr 11, 2026Updated last month
- Implementation of "REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering"☆35Nov 21, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆16Nov 25, 2025Updated 5 months ago
- ☆15Apr 11, 2024Updated 2 years ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆47Apr 23, 2026Updated 3 weeks ago
- ☆13Apr 9, 2021Updated 5 years ago
- The backup repository for FairytaleQA dataset and paper "Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset f…☆10May 30, 2023Updated 2 years ago
- ☆10Apr 24, 2024Updated 2 years ago
- ☆30Apr 30, 2026Updated 2 weeks ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆25Jun 28, 2025Updated 10 months ago
- ☆14Jul 25, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆38Nov 13, 2025Updated 6 months ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- ☆23Dec 16, 2025Updated 5 months ago
- ☆16Aug 19, 2024Updated last year
- ☆12Jul 30, 2025Updated 9 months ago
- A library for structural-semantic chunking of documents.☆12Oct 8, 2025Updated 7 months ago
- ☆44Dec 14, 2024Updated last year
- ☆11Sep 20, 2024Updated last year
- ☆10Jun 21, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Indexing/curating/documenting Siri Shortcuts on RoutineHub.☆14Oct 29, 2022Updated 3 years ago
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆30Apr 9, 2026Updated last month
- Optimization Case Studies: Generic Time Scheduling Problem (GTSP), Resource-Constrained Project Scheduling Problem (RCPSP) with Pulse Var…☆11Nov 7, 2018Updated 7 years ago
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆43Apr 21, 2026Updated 3 weeks ago
- 🌸 A collection of Vietnamese women who are currently working in the field of Computer Science.☆16May 1, 2026Updated 2 weeks ago
- [ICCAD 2025] Squant☆15Jul 3, 2025Updated 10 months ago
- 📆 iOS Todo calendar app☆14May 11, 2026Updated last week