☆295Apr 15, 2026Updated 2 months ago
Alternatives and similar repositories for llm-jepa
Users that are interested in llm-jepa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Dec 12, 2024Updated last year
- ☆13Apr 23, 2025Updated last year
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 6 months ago
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆32Feb 6, 2026Updated 4 months ago
- The repository contains code for Adaptive Data Optimization☆36Dec 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆113Nov 25, 2025Updated 6 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"☆46May 13, 2026Updated last month
- ☆176Apr 23, 2025Updated last year
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆35Oct 12, 2025Updated 8 months ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- ☆24Mar 7, 2025Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆37Aug 14, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- RePo: Language Models with Context Re-Positioning☆77Mar 30, 2026Updated 2 months ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆88Mar 22, 2026Updated 2 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆19Sep 13, 2024Updated last year
- Repository for ‘Anomaly Detection and Generation with Diffusion Models: A Survey’.☆39Jun 15, 2025Updated last year
- Effect of tokenization on transformers for biological sequence☆23Dec 31, 2025Updated 5 months ago
- Distributed file system☆13May 10, 2011Updated 15 years ago
- Reliable, minimal and scalable library for pretraining foundation and world models☆258Jun 5, 2026Updated last week
- Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.☆17Sep 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS 2023] Latent Graph Inference with Limited Supervision☆33Feb 1, 2024Updated 2 years ago
- The official repo for the code and data of paper SMART☆40Feb 20, 2025Updated last year
- CADEvolve: Creating Realistic CAD via Program Evolution☆43Feb 19, 2026Updated 3 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- About The official GitHub page for ''Unleashing the Potential of Large Language Models as Prompt Optimizers: An Analogical Analysis with …☆29Dec 12, 2024Updated last year
- AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse A…☆25Jan 26, 2026Updated 4 months ago
- The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".☆17Jul 2, 2024Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57May 17, 2024Updated 2 years ago
- ☆11Jun 22, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repository Flash Local Linear Attention☆36May 28, 2026Updated 2 weeks ago
- This repository includes the code for HiCo (PyTorch version).☆12Sep 24, 2022Updated 3 years ago
- rl from zero pretrain, can it be done? yes.☆294Sep 28, 2025Updated 8 months ago
- Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"☆188May 28, 2026Updated 2 weeks ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 8 months ago
- All information and news with respect to Falcon-H1 series☆119Oct 9, 2025Updated 8 months ago