cloneofsimo / minVJEPALinks
☆22Updated 2 weeks ago
Alternatives and similar repositories for minVJEPA
Users that are interested in minVJEPA are comparing it to the libraries listed below
Sorting:
- ☆33Updated 5 months ago
- Focused on fast experimentation and simplicity☆73Updated 5 months ago
- ☆23Updated 11 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated last month
- ☆33Updated 7 months ago
- ☆56Updated 2 months ago
- Code accompanying the paper "Generalized Interpolating Discrete Diffusion"☆80Updated last week
- Official implementation of ECCV24 paper: POA☆24Updated 9 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 3 months ago
- Code for the paper "Function-Space Learning Rates"☆20Updated last month
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆27Updated last week
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆52Updated 2 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆42Updated 6 months ago
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆31Updated last month
- ☆19Updated 2 weeks ago
- ☆33Updated 8 months ago
- RS-IMLE☆39Updated 5 months ago
- ☆24Updated last year
- ☆28Updated 10 months ago
- ☆15Updated 6 months ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Updated 2 weeks ago
- ☆24Updated last month
- research impl of Native Sparse Attention (2502.11089)☆54Updated 3 months ago
- ☆30Updated 7 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last month
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆50Updated this week
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated last month
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- ☆63Updated 8 months ago
- alternative way to calculating self attention☆18Updated last year