☆33Aug 28, 2024Updated last year
Alternatives and similar repositories for Speculative-RAG
Users that are interested in Speculative-RAG are comparing it to the libraries listed below
Sorting:
- Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".☆29Feb 10, 2025Updated last year
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆19Aug 5, 2025Updated 7 months ago
- Enhancing contextual understanding in large language models through contrastive decoding☆20May 3, 2024Updated last year
- 🔌 Want one client library for all your embeddings? 💙 Choose Catsu! 🐱☆62Feb 20, 2026Updated last month
- Source code of our paper MIND, ACL 2024 Long Paper☆63Nov 14, 2025Updated 4 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated 10 months ago
- ☆23Mar 1, 2025Updated last year
- ☆13Jul 20, 2023Updated 2 years ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- Planning for Success: Exploring LLM Long-term Planning Capabilities in Table Understanding☆17Jun 17, 2025Updated 9 months ago
- ☆11Apr 5, 2023Updated 2 years ago
- [CVPR 2025] QuartDepth☆17Mar 24, 2025Updated 11 months ago
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Apr 14, 2023Updated 2 years ago
- ☆16Dec 9, 2023Updated 2 years ago
- ☆13Jul 14, 2025Updated 8 months ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆15Nov 25, 2025Updated 3 months ago
- ☆15Apr 11, 2024Updated last year
- Code and data for paper "Large language models can rate news outlet credibility"☆13Aug 10, 2024Updated last year
- The backup repository for FairytaleQA dataset and paper "Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset f…☆10May 30, 2023Updated 2 years ago
- ☆19Aug 23, 2024Updated last year
- ☆10Apr 24, 2024Updated last year
- ☆39Dec 14, 2024Updated last year
- ☆10Apr 24, 2022Updated 3 years ago
- ☆37Jan 19, 2026Updated 2 months ago
- ☆13Jul 25, 2024Updated last year
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 8 months ago
- ☆15Aug 19, 2024Updated last year
- ☆12Jul 30, 2025Updated 7 months ago
- BESA is a differentiable weight pruning technique for large language models.☆17Mar 4, 2024Updated 2 years ago
- ☆11Sep 20, 2024Updated last year
- Indexing/curating/documenting Siri Shortcuts on RoutineHub.☆13Oct 29, 2022Updated 3 years ago
- Fork of Flame repo for training of some new stuff in development☆19Updated this week
- ☆10Jun 21, 2021Updated 4 years ago
- A dataset and CLIP baseline for unrepresentative news thumbnail detection (ACL 2022 workshop)☆12May 26, 2022Updated 3 years ago
- ☆16Oct 12, 2024Updated last year
- [ICCAD 2025] Squant☆15Jul 3, 2025Updated 8 months ago
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that …☆12Nov 20, 2024Updated last year
- RepGhostNetV2: When RepGhost meets MobileNetV4☆16May 29, 2024Updated last year
- This is the source code of IJCNN 2023 paper TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection (TieFake).☆16Dec 21, 2023Updated 2 years ago