Clustered Compositional Embeddings
☆11Oct 25, 2023Updated 2 years ago
Alternatives and similar repositories for cce
Users that are interested in cce are comparing it to the libraries listed below
Sorting:
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆14Nov 11, 2023Updated 2 years ago
- ☆12Sep 16, 2024Updated last year
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- [AAAI'23] FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction https://arxiv.org/abs/2304.00902☆11Apr 9, 2023Updated 2 years ago
- ☆35Apr 12, 2024Updated last year
- ☆33Oct 4, 2024Updated last year
- PyTorch implementation for MRL☆22Feb 22, 2024Updated 2 years ago
- Code for lin-RFM used for sparse recovery tasks☆16Mar 13, 2025Updated last year
- MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…☆71Mar 15, 2026Updated last week
- ☆63Oct 3, 2024Updated last year
- Implementations of various linear RNN layers using pytorch and triton☆54Aug 4, 2023Updated 2 years ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated last year
- ☆15Jul 13, 2025Updated 8 months ago
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- Ἀνατομή is a PyTorch library to analyze representation of neural networks☆13Jan 31, 2024Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- ☆15Apr 26, 2022Updated 3 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆31Jul 30, 2025Updated 7 months ago
- An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"☆59May 12, 2024Updated last year
- A Zen approach to configuring your Python project☆16Feb 27, 2026Updated 3 weeks ago
- ☆10Oct 24, 2024Updated last year
- Implementation of the WWW'23 paper "Toward Degree Bias in Embedding-Based Knowledge Graph Completion"☆15Jun 17, 2023Updated 2 years ago
- Triton-based Symmetric Memory operators and examples☆91Jan 15, 2026Updated 2 months ago
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 9 months ago
- ☆29Oct 24, 2025Updated 4 months ago
- ☆16Apr 10, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆12Jan 17, 2024Updated 2 years ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- ☆20Jul 19, 2024Updated last year
- Data science with Pandas and NumPy: EDA, binning, distribution functions, simulations, regression analysis☆11Dec 26, 2024Updated last year
- 📦 Cerebro plugin for applications search and launch on windows and linux☆12Aug 27, 2022Updated 3 years ago
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- Least Squares Regression for subspace clustering☆10May 27, 2018Updated 7 years ago
- ☆12Mar 19, 2021Updated 5 years ago
- Relation-aware Ensemble Learning for Knowledge Graph Embedding. EMNLP. 2023☆25Dec 1, 2023Updated 2 years ago
- iADMM for a low-rank representation optimization problem☆13Feb 5, 2021Updated 5 years ago
- Conditional Linear Dynamical Systems☆15Oct 7, 2025Updated 5 months ago
- Muon fsdp 2☆55Aug 8, 2025Updated 7 months ago