☆262Apr 15, 2026Updated 2 weeks ago
Alternatives and similar repositories for llm-jepa
Users that are interested in llm-jepa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fast, lightweight, and extensible RWKV chat UI powered by Flutter. Offline-ready, multi-backend support, ideal for local RWKV inference…☆93Updated this week
- ☆14Dec 12, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆32Feb 6, 2026Updated 2 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The repository contains code for Adaptive Data Optimization☆36Dec 9, 2024Updated last year
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 4 months ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"☆44Sep 12, 2025Updated 7 months ago
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆71Mar 22, 2026Updated last month
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆33Oct 12, 2025Updated 6 months ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- ☆23Mar 7, 2025Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆35Aug 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- Reliable, minimal and scalable library for pretraining foundation and world models☆210Apr 29, 2026Updated last week
- Effect of tokenization on transformers for biological sequence☆23Dec 31, 2025Updated 4 months ago
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆24Mar 16, 2025Updated last year
- Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.☆17Sep 27, 2023Updated 2 years ago
- The official repo for the code and data of paper SMART☆40Feb 20, 2025Updated last year
- Experiments in Joint Embedding Predictive Architectures (JEPAs).☆54Jan 5, 2024Updated 2 years ago
- The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".☆16Jul 2, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Survey and Benchmark of Anomaly Detection in Business Processes☆18Jan 23, 2026Updated 3 months ago
- RWKV-7 mini☆12Mar 29, 2025Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57May 17, 2024Updated last year
- ☆11Jun 22, 2025Updated 10 months ago
- Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"☆172Mar 15, 2026Updated last month
- ☆1,084Jan 25, 2026Updated 3 months ago
- Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**☆236Apr 11, 2026Updated 3 weeks ago
- This repository includes the code for HiCo (PyTorch version).☆11Sep 24, 2022Updated 3 years ago
- rl from zero pretrain, can it be done? yes.☆292Sep 28, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".☆12Feb 28, 2026Updated 2 months ago
- All information and news with respect to Falcon-H1 series☆116Oct 9, 2025Updated 6 months ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 6 months ago
- Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding☆217Jan 12, 2026Updated 3 months ago
- REverse-Engineered Reasoning for Open-Ended Generation☆95Sep 10, 2025Updated 7 months ago
- ☆19Mar 31, 2024Updated 2 years ago
- SimKO: Simple Pass@K Policy Optimization☆31Oct 24, 2025Updated 6 months ago