☆114Sep 13, 2025Updated 6 months ago
Alternatives and similar repositories for Multiverse
Users that are interested in Multiverse are comparing it to the libraries listed below
Sorting:
- ☆90Jun 16, 2025Updated 9 months ago
- ☆62Updated this week
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- Repo for collaboration on OSS agentic code search☆35Updated this week
- ☆12Sep 1, 2023Updated 2 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆883Jan 28, 2026Updated last month
- SuperDebug,debug如此简单!☆17Jul 19, 2022Updated 3 years ago
- ☆17Aug 5, 2025Updated 7 months ago
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆53Feb 21, 2025Updated last year
- [ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation☆251Dec 16, 2024Updated last year
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Jan 31, 2026Updated last month
- ☆27Jul 23, 2025Updated 7 months ago
- Kinetics: Rethinking Test-Time Scaling Laws☆86Jul 11, 2025Updated 8 months ago
- ☆29Oct 8, 2025Updated 5 months ago
- (NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"☆46Feb 11, 2026Updated last month
- [ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference☆286May 1, 2025Updated 10 months ago
- RND1: Scaling Diffusion Language Models☆176Feb 22, 2026Updated 3 weeks ago
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆45Jan 6, 2026Updated 2 months ago
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆30Jun 30, 2025Updated 8 months ago
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆41Jul 18, 2025Updated 8 months ago
- ☆47Apr 9, 2025Updated 11 months ago
- TinyNS: Platform-Aware Neurosymbolic Auto Tiny Machine Learning☆25Jun 2, 2023Updated 2 years ago
- ☆84Apr 3, 2025Updated 11 months ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆45Sep 19, 2025Updated 6 months ago
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆107Aug 5, 2025Updated 7 months ago
- ☆28Aug 13, 2025Updated 7 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated last month
- Implementation of <Symbolic Graphics Programming with Large Language Models>☆38Sep 14, 2025Updated 6 months ago
- [ICML 2024] CLLMs: Consistency Large Language Models☆413Nov 16, 2024Updated last year
- Compression for Foundation Models☆35Jul 21, 2025Updated 7 months ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆42Feb 22, 2026Updated 3 weeks ago
- A Homework for Computer Architecture at SJTU☆14Jan 4, 2020Updated 6 years ago
- ☆14Nov 11, 2019Updated 6 years ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 11 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆423Oct 4, 2025Updated 5 months ago
- Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference☆48Jun 19, 2024Updated last year