jerber / lang-jepa
☆109Updated 4 months ago
Alternatives and similar repositories for lang-jepa:
Users that are interested in lang-jepa are comparing it to the libraries listed below
- ☆97Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 3 months ago
- smolLM with Entropix sampler on pytorch☆151Updated 6 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆171Updated this week
- smol models are fun too☆92Updated 5 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated last month
- Simple Transformer in Jax☆136Updated 10 months ago
- Draw more samples☆189Updated 10 months ago
- Open source interpretability artefacts for R1.☆103Updated 2 weeks ago
- prime-rl is a codebase for decentralized RL training at scale☆85Updated this week
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆171Updated last week
- ☆130Updated last month
- Train your own SOTA deductive reasoning model☆91Updated last month
- ☆80Updated 3 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆189Updated 11 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆105Updated this week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆137Updated last month
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆307Updated 5 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆91Updated 2 weeks ago
- ☆94Updated 3 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 6 months ago
- ☆54Updated 3 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆150Updated this week
- DeMo: Decoupled Momentum Optimization☆186Updated 5 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆172Updated last month
- The history files when recording human interaction while solving ARC tasks☆107Updated last week
- look how they massacred my boy☆63Updated 6 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆82Updated last month
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆189Updated 5 months ago