Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
β13Nov 19, 2024Updated last year
Alternatives and similar repositories for FS-GEN
Users that are interested in FS-GEN are comparing it to the libraries listed below
Sorting:
- π Sliding Window Attention Training for Efficient Large Language Modelsβ16Dec 8, 2025Updated 2 months ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)β17May 15, 2025Updated 9 months ago
- [COLM 2024] Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluationβ15Jul 15, 2024Updated last year
- β21Oct 25, 2024Updated last year
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimizaβ¦β20Nov 21, 2024Updated last year
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.β25Dec 16, 2024Updated last year
- β47Oct 2, 2025Updated 5 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-wβ¦β12Jun 28, 2025Updated 8 months ago
- A Text2SQL benchmark for evaluation of Large Language Modelsβ41Feb 24, 2026Updated last week
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- β36Feb 12, 2025Updated last year
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenariosβ16Oct 18, 2024Updated last year
- β18Jun 10, 2025Updated 8 months ago
- A Sober Look at Language Model Reasoningβ93Nov 18, 2025Updated 3 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automatonβ42Feb 13, 2025Updated last year
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMsβ42Updated this week
- A Framework for Evaluating AI Agent Safety in Realistic Environmentsβ30Oct 2, 2025Updated 5 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"β10Jul 19, 2024Updated last year
- The official implement of paper γDaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agentsγβ29Oct 23, 2025Updated 4 months ago
- Symphony β A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge deviβ¦β30Oct 30, 2025Updated 4 months ago
- β11Jun 22, 2025Updated 8 months ago
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agentsβ16Sep 16, 2025Updated 5 months ago
- The code implementation of Symbolic-MoEβ46Sep 2, 2025Updated 6 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Modelsβ48Sep 18, 2025Updated 5 months ago
- β25Aug 19, 2025Updated 6 months ago
- A Practical Zoom-in GUI Grounding and Behavior-Based Evaluation method.β20Dec 8, 2025Updated 2 months ago
- β14Apr 14, 2025Updated 10 months ago
- β12Jun 11, 2025Updated 8 months ago
- Introduction to Machine Learning using scikit-learn and PyTorchβ10Sep 26, 2019Updated 6 years ago
- β11Jun 18, 2023Updated 2 years ago
- β14Dec 20, 2021Updated 4 years ago
- β12Dec 15, 2025Updated 2 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversityβ22Aug 28, 2025Updated 6 months ago
- Anchored Diffusion Language Model (NeurIPS 2025)β27Oct 13, 2025Updated 4 months ago
- β16Sep 17, 2024Updated last year
- β14Mar 20, 2025Updated 11 months ago
- Continuous Pipelined Speculative Decodingβ16Jan 4, 2026Updated 2 months ago
- Official Implementation of HIMA (COLM'25)β19Nov 25, 2025Updated 3 months ago
- Code for the paper "Semi-Conditional Normalizing Flows for Semi-Supervised Learning"β11Mar 30, 2020Updated 5 years ago