cloneofsimo / minVJEPALinks
☆22Updated last month
Alternatives and similar repositories for minVJEPA
Users that are interested in minVJEPA are comparing it to the libraries listed below
Sorting:
- ☆33Updated 7 months ago
- Focused on fast experimentation and simplicity☆75Updated 6 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆27Updated last month
- ☆17Updated 7 months ago
- ☆56Updated 3 months ago
- ☆33Updated 5 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated last month
- Code accompanying the paper "Generalized Interpolating Discrete Diffusion"☆85Updated 2 weeks ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Updated last month
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated 7 months ago
- research impl of Native Sparse Attention (2502.11089)☆54Updated 4 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆53Updated 3 weeks ago
- Official implementation of ECCV24 paper: POA☆24Updated 10 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆52Updated 3 months ago
- ☆23Updated last year
- ☆24Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 3 months ago
- RS-IMLE☆40Updated 6 months ago
- ☆28Updated 10 months ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆47Updated 3 months ago
- ☆19Updated last month
- ☆47Updated 4 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- GoldFinch and other hybrid transformer components☆45Updated 11 months ago
- GoldFinch and other hybrid transformer components☆10Updated last month
- Simple repository for training small reasoning models☆33Updated 4 months ago
- ☆34Updated 9 months ago
- ☆24Updated last month
- Resa: Transparent Reasoning Models via SAEs☆36Updated 2 weeks ago
- ☆65Updated last year