☆279Apr 15, 2026Updated last month
Alternatives and similar repositories for llm-jepa
Users that are interested in llm-jepa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆160Mar 6, 2026Updated 2 months ago
- A fast, lightweight, and extensible RWKV chat UI powered by Flutter. Offline-ready, multi-backend support, ideal for local RWKV inference…☆96May 18, 2026Updated last week
- ☆14Dec 12, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- ☆13Apr 23, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆32Feb 6, 2026Updated 3 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 5 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"☆46May 13, 2026Updated 2 weeks ago
- ☆176Apr 23, 2025Updated last year
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆35Oct 12, 2025Updated 7 months ago
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆83Mar 22, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆23Mar 7, 2025Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆37Aug 14, 2024Updated last year
- RePo: Language Models with Context Re-Positioning☆75Mar 30, 2026Updated last month
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆19Sep 13, 2024Updated last year
- Reliable, minimal and scalable library for pretraining foundation and world models☆234May 17, 2026Updated last week
- Distributed file system☆13May 10, 2011Updated 15 years ago
- Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.☆17Sep 27, 2023Updated 2 years ago
- Official code for SongEcho☆63Mar 3, 2026Updated 2 months ago
- The official repo for the code and data of paper SMART☆40Feb 20, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- Experiments in Joint Embedding Predictive Architectures (JEPAs).☆60Jan 5, 2024Updated 2 years ago
- AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse A…☆25Jan 26, 2026Updated 4 months ago
- Survey and Benchmark of Anomaly Detection in Business Processes☆18Jan 23, 2026Updated 4 months ago
- The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".☆16Jul 2, 2024Updated last year
- MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots☆95Dec 5, 2025Updated 5 months ago
- ☆28Nov 18, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Jun 22, 2025Updated 11 months ago
- Official repository Flash Local Linear Attention☆23Apr 23, 2026Updated last month
- ☆1,146Jan 25, 2026Updated 4 months ago
- Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"☆179May 14, 2026Updated last week
- Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".☆13Feb 28, 2026Updated 2 months ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**☆270Apr 11, 2026Updated last month