vicksEmmanuel / latent-gemma
☆25Updated 3 months ago
Alternatives and similar repositories for latent-gemma:
Users that are interested in latent-gemma are comparing it to the libraries listed below
- ☆60Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated last month
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆165Updated 4 months ago
- Replicating O1 inference-time scaling laws☆84Updated 5 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 8 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 3 months ago
- A repository for research on medium sized language models.☆76Updated 11 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆91Updated 2 weeks ago
- ☆91Updated 7 months ago
- ☆170Updated 2 weeks ago
- ☆114Updated 2 months ago
- ☆78Updated 8 months ago
- ☆17Updated 2 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆29Updated last month
- This repo is based on https://github.com/jiaweizzhao/GaLore☆27Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆105Updated this week
- ☆31Updated 3 months ago
- prime-rl is a codebase for decentralized RL training at scale☆85Updated this week
- ☆46Updated 2 months ago
- Simple repository for training small reasoning models☆27Updated 3 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated last month
- ☆44Updated 11 months ago
- ☆97Updated 10 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆23Updated 2 weeks ago
- ☆33Updated 10 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆76Updated last month
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆45Updated 2 weeks ago
- ☆77Updated 3 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆27Updated this week
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month