ShadeAlsha / IConLinks

ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"

☆109

Alternatives and similar repositories for ICon

Users that are interested in ICon are comparing it to the libraries listed below

Sorting:

s-sahoo / Eso-LMs
Esoteric Language Models
☆89Updated last week
KindXiaoming / grow-crystals
Getting crystal-like representations with harmonic loss
☆192Updated 4 months ago
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆143Updated 2 months ago
lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆56Updated 2 months ago
ml-jku / EVA
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆41Updated 9 months ago
ml-jku / hopfield-boosting
☆31Updated last year
PolymathicAI / xVal
Repository for code used in the xVal paper
☆140Updated last year
zaydzuhri / softpick-attention
Implementations of attention with the softpick function, naive and FlashAttention-2
☆81Updated 3 months ago
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆101Updated 7 months ago
facebookresearch / Mixture-of-Transformers
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.
☆88Updated 2 months ago
alexiglad / EBT
PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
☆400Updated 2 weeks ago
GenRobo / MatMamba
Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"
☆60Updated 8 months ago
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆184Updated 6 months ago
jfpuget / ARC-AGI-Challenge-2024
☆56Updated 8 months ago
Think-a-Tron / evolve
open source alpha evolve
☆66Updated 2 months ago
facebookresearch / collaborative-reasoner
Source code for the collaborative reasoner research project at Meta FAIR.
☆99Updated 3 months ago
fangyuan-ksgk / Mini-LLaVA
A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.
☆94Updated 7 months ago
jacobmarks / awesome-neurips-2023
Conference schedule, top papers, and analysis of the data for NeurIPS 2023!
☆119Updated last year
Zyphra / BlackMamba
Code repository for Black Mamba
☆252Updated last year
NVlabs / hymba
☆190Updated 7 months ago
epfml / DenseFormer
☆81Updated last year
EPFL-VILAB / fm-vision-evals
☆68Updated 2 weeks ago
ChenWu98 / algorithmic-creativity
[ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
☆56Updated 2 months ago
hyperevolnet / Terminator
The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.
☆38Updated 4 months ago
lucidrains / PEER-pytorch
Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind
☆127Updated 11 months ago
bluorion-com / ZClip
Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
☆131Updated last month
rjha18 / vec2vec
☆216Updated last month
ariG23498 / mmdp
☆27Updated 3 weeks ago
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆130Updated 2 months ago
g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆29Updated 3 months ago