minyoungg / platonic-repLinks

☆619

Alternatives and similar repositories for platonic-rep

Users that are interested in platonic-rep are comparing it to the libraries listed below

Sorting:

test-time-training / ttt-lm-jax
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
☆423Updated last year
Haiyang-W / TokenFormer
[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
☆574Updated 8 months ago
louaaron / Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
☆651Updated last year
openai / sparse_autoencoder
☆532Updated last year
Prisma-Multimodal / ViT-Prisma
ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).
☆315Updated 3 months ago
srush / annotated-mamba
Annotated version of the Mamba paper
☆489Updated last year
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆330Updated 11 months ago
goombalab / hnet
H-Net: Hierarchical Network with Dynamic Chunking
☆760Updated 3 weeks ago
lucidrains / nGPT-pytorch
Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI
☆291Updated 4 months ago
kuleshov-group / mdlm
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
☆537Updated 3 weeks ago
ML-GSAI / SMDM
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆326Updated 10 months ago
alexiglad / EBT
PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
☆534Updated last month
lucidrains / st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
☆366Updated last year
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆836Updated last week
HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆320Updated 4 months ago
apple / ml-sigmoid-attention
☆302Updated 6 months ago
jzhang38 / LongMamba
Some preliminary explorations of Mamba's context scaling.
☆216Updated last year
lucidrains / ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
☆542Updated 5 months ago
facebookresearch / jepa-intuitive-physics
This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"
☆189Updated 8 months ago
kuleshov-group / awesome-discrete-diffusion-models
A curated list for awesome discrete diffusion models resources.
☆476Updated last month
MadryLab / modelcomponents
Decomposing and Editing Predictions by Modeling Model Computation
☆138Updated last year
lucidrains / coconut-pytorch
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
☆179Updated 4 months ago
tatsu-lab / gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
☆536Updated last year
NVlabs / DoRA
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
☆866Updated last year
xu3kev / BARC
Bootstrapping ARC
☆144Updated 11 months ago
HKUNLP / diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
☆182Updated 7 months ago
dingo-actual / infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…
☆292Updated last year
NVIDIA / ngpt
Normalized Transformer (nGPT)
☆192Updated 11 months ago
kuleshov-group / bd3lms
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆856Updated 3 months ago
iliao2345 / CompressARC
☆194Updated 2 months ago