jerber / lang-jepa
☆98Updated last month
Alternatives and similar repositories for lang-jepa:
Users that are interested in lang-jepa are comparing it to the libraries listed below
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆167Updated last month
- smolLM with Entropix sampler on pytorch☆150Updated 3 months ago
- ☆96Updated 4 months ago
- ☆80Updated last month
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 4 months ago
- look how they massacred my boy☆63Updated 4 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆129Updated this week
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆164Updated this week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆184Updated 8 months ago
- ☆121Updated last week
- Draw more samples☆186Updated 7 months ago
- Simple Transformer in Jax☆136Updated 7 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆137Updated 2 weeks ago
- A MAD laboratory to improve AI architecture designs 🧪☆102Updated 2 months ago
- ☆106Updated 3 weeks ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆95Updated 3 months ago
- smol models are fun too☆88Updated 3 months ago
- ☆123Updated 6 months ago
- Long context evaluation for large language models☆199Updated last week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆161Updated last week
- DeMo: Decoupled Momentum Optimization☆180Updated 2 months ago
- PyTorch implementation of models from the Zamba2 series.☆176Updated 3 weeks ago
- PyTorch library for Active Fine-Tuning☆55Updated this week
- Understand and test language model architectures on synthetic tasks.☆181Updated last month
- Extract full next-token probabilities via language model APIs☆228Updated 11 months ago
- ☆161Updated last month