☆42Apr 23, 2024Updated last year
Alternatives and similar repositories for Faster-LLM-Survey
Users that are interested in Faster-LLM-Survey are comparing it to the libraries listed below
Sorting:
- BESA is a differentiable weight pruning technique for large language models.☆17Mar 4, 2024Updated 2 years ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated last year
- Replication package for evaluation of code generation metrics☆16Nov 24, 2025Updated 3 months ago
- Transmute AI Lab Model Efficiency Toolkit☆19Oct 2, 2023Updated 2 years ago
- Fork of Flame repo for training of some new stuff in development☆19Feb 27, 2026Updated last week
- Lightweight PDF Q&A tool powered by RAG (Retrieval-Augmented Generation) with MCP (Model Context Protocol) Support.☆22Oct 27, 2025Updated 4 months ago
- Iterate fast on your RAG pipelines☆24Jun 21, 2025Updated 8 months ago
- Evaluate your model using advanced prompt strategies☆21Jan 30, 2026Updated last month
- Are gradient information useful for pruning of LLMs?☆47Aug 23, 2025Updated 6 months ago
- This repo contains the code for studying the interplay between quantization and sparsity methods☆26Feb 26, 2025Updated last year
- 3-Pipeline LLMOps Financial advisor. Steaming pipeline deployed on AWS, 24/7 collects, embeds live-data into QdrantDB. Training pipeline …☆25Apr 12, 2025Updated 10 months ago
- ☆58Oct 6, 2023Updated 2 years ago
- Structural Pruning for LLaMA☆54May 20, 2023Updated 2 years ago
- WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction☆61Sep 3, 2025Updated 6 months ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Nov 6, 2023Updated 2 years ago
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated last year
- ☆29Jun 11, 2023Updated 2 years ago
- ☆34Aug 23, 2023Updated 2 years ago
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆70Jan 6, 2024Updated 2 years ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- Pseudo-code Instructions dataset☆27Dec 18, 2023Updated 2 years ago
- Superposition Yields Robust Neural Scaling☆58Feb 12, 2026Updated 3 weeks ago
- BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generat…☆28Feb 6, 2022Updated 4 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Aug 6, 2023Updated 2 years ago
- ☆35May 24, 2024Updated last year
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.☆32Jun 5, 2024Updated last year
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- ☆11Jan 3, 2024Updated 2 years ago
- Text Normalization utilities for normalizing text for TTS☆21Updated this week
- LMTuner: Make the LLM Better for Everyone☆38Sep 21, 2023Updated 2 years ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆79Oct 29, 2025Updated 4 months ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Feb 1, 2024Updated 2 years ago
- ☆13Feb 4, 2025Updated last year
- ☆10Feb 10, 2022Updated 4 years ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 3 months ago
- Vectorgraph Image Painter☆12Mar 24, 2019Updated 6 years ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆48Jan 4, 2025Updated last year
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Nov 2, 2023Updated 2 years ago
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year