s-sahoo / Eso-LMsLinks
Esoteric Language Models
☆103Updated 3 weeks ago
Alternatives and similar repositories for Eso-LMs
Users that are interested in Eso-LMs are comparing it to the libraries listed below
Sorting:
- ☆215Updated 2 weeks ago
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆222Updated last month
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆53Updated 2 weeks ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆139Updated last week
- ☆100Updated last month
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆289Updated 2 weeks ago
- ☆86Updated last year
- [EMNLP'2025 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆66Updated 6 months ago
- Official repo of paper LM2☆47Updated 8 months ago
- ☆108Updated last year
- SSRL: Self-Search Reinforcement Learning☆148Updated 2 months ago
- ☆65Updated 7 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆35Updated 2 weeks ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆176Updated 3 months ago
- Process Reward Models That Think☆60Updated 2 weeks ago
- ☆85Updated 9 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆43Updated 2 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆162Updated 6 months ago
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆55Updated 11 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆324Updated 5 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆102Updated this week
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 2 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆89Updated last year
- Physics of Language Models, Part 4☆252Updated 3 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 7 months ago
- ☆33Updated 9 months ago
- RLP: Reinforcement as a Pretraining Objective☆195Updated 3 weeks ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆129Updated 2 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 5 months ago
- ☆29Updated 4 months ago