thomasahle / cceLinks
Clustered Compositional Embeddings
โ11Updated 2 years ago
Alternatives and similar repositories for cce
Users that are interested in cce are comparing it to the libraries listed below
Sorting:
- nanoGPT-like codebase for LLM trainingโ109Updated last week
- A MAD laboratory to improve AI architecture designs ๐งชโ133Updated 10 months ago
- gzip Predicts Data-dependent Scaling Lawsโ34Updated last year
- โ34Updated 11 months ago
- โ53Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAXโ89Updated last year
- Minimum Description Length probing for neural network representationsโ20Updated 9 months ago
- โ81Updated last year
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden Stateโ20Updated 3 weeks ago
- Universal Neurons in GPT2 Language Modelsโ31Updated last year
- โ61Updated last year
- Sparse and discrete interpretability tool for neural networksโ64Updated last year
- PyTorch implementation for MRLโ19Updated last year
- Sparse Autoencoder Training Libraryโ55Updated 6 months ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)โ15Updated last year
- Evaluation of neuro-symbolic enginesโ39Updated last year
- Understanding how features learned by neural networks evolve throughout trainingโ39Updated last year
- LLM training in simple, raw C/CUDAโ15Updated 11 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"โ85Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentโ60Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.โ172Updated 4 months ago
- Attribution-based Parameter Decompositionโ31Updated 5 months ago
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).โ91Updated 3 months ago
- โ55Updated last year
- โ52Updated 7 months ago
- โ23Updated 9 months ago
- An annotated implementation of the Hyena Hierarchy paperโ34Updated 2 years ago
- โ87Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"โ91Updated last year
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok โฆโ27Updated last week