s-sahoo / Eso-LMsView external linksLinks
Esoteric Language Models
☆111Updated this week
Alternatives and similar repositories for Eso-LMs
Users that are interested in Eso-LMs are comparing it to the libraries listed below
Sorting:
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 4 months ago
- Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!☆33Jun 23, 2025Updated 7 months ago
- [ICML 2025] The Diffusion Duality☆187Dec 27, 2025Updated last month
- ☆14Oct 4, 2024Updated last year
- [NeurIPS 2024] Simple and Effective Masked Diffusion Language Model☆619Sep 29, 2025Updated 4 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated last month
- ☆19Aug 4, 2025Updated 6 months ago
- Implementation of SOAR☆49Sep 17, 2025Updated 4 months ago
- Code and data for paper "(How) do Language Models Track State?"☆21Mar 31, 2025Updated 10 months ago
- ☆12Apr 17, 2025Updated 9 months ago
- Work in progress.☆79Nov 25, 2025Updated 2 months ago
- Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation☆61Jul 22, 2025Updated 6 months ago
- Fork of Flame repo for training of some new stuff in development☆19Jan 5, 2026Updated last month
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Jun 5, 2025Updated 8 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated last year
- ☆270Jun 6, 2025Updated 8 months ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- Generative Modeling via Drifting in MLX☆38Feb 6, 2026Updated last week
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆364Dec 22, 2024Updated last year
- Dream 7B, a large diffusion language model☆1,164Nov 21, 2025Updated 2 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆197Nov 17, 2025Updated 2 months ago
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆24Oct 5, 2025Updated 4 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 7 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆66Updated this week
- ☆19Mar 3, 2025Updated 11 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆21Nov 9, 2025Updated 3 months ago
- Multimodal RewardBench☆61Feb 21, 2025Updated 11 months ago
- Your favourite classical machine learning algos on the GPU/TPU☆21Dec 14, 2025Updated last month
- ☆34Jul 8, 2025Updated 7 months ago
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆755Feb 3, 2026Updated last week
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".☆30Mar 19, 2025Updated 10 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,554Nov 12, 2025Updated 3 months ago
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆40Jul 18, 2025Updated 6 months ago
- An official PyTorch implementation for CLIPPR☆30Jul 22, 2023Updated 2 years ago
- ☆47Oct 2, 2025Updated 4 months ago
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models☆1,574Nov 16, 2025Updated 2 months ago
- [ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)☆701Feb 29, 2024Updated last year
- The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…☆42Jan 5, 2026Updated last month
- ☆67Mar 6, 2025Updated 11 months ago