jerber / lang-jepa
☆107Updated 3 months ago
Alternatives and similar repositories for lang-jepa:
Users that are interested in lang-jepa are comparing it to the libraries listed below
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆170Updated 2 months ago
- ☆97Updated 6 months ago
- smolLM with Entropix sampler on pytorch☆151Updated 5 months ago
- Simple Transformer in Jax☆136Updated 9 months ago
- Train your own SOTA deductive reasoning model☆83Updated last month
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆169Updated this week
- ☆80Updated 3 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆93Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆79Updated last month
- smol models are fun too☆91Updated 5 months ago
- ☆128Updated last week
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 6 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆102Updated this week
- ☆129Updated 7 months ago
- Draw more samples☆189Updated 9 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆60Updated last week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆135Updated last month
- Repository for the paper Stream of Search: Learning to Search in Language☆144Updated 2 months ago
- An introduction to LLM Sampling☆77Updated 3 months ago
- ☆51Updated 2 months ago
- ☆126Updated last week
- A MAD laboratory to improve AI architecture designs 🧪☆109Updated 3 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆428Updated 6 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆103Updated 4 months ago
- Bootstrapping ARC☆108Updated 4 months ago
- ⚖️ Awesome LLM Judges ⚖️☆88Updated last month
- DeMo: Decoupled Momentum Optimization☆185Updated 4 months ago
- Long context evaluation for large language models☆206Updated last month
- ☆92Updated 2 months ago
- look how they massacred my boy☆63Updated 5 months ago