fxlrnrpt / ml_misfitsView external linksLinks
Study group / research-padawan community for the misfits
☆33Oct 15, 2025Updated 4 months ago
Alternatives and similar repositories for ml_misfits
Users that are interested in ml_misfits are comparing it to the libraries listed below
Sorting:
- Skoltech NLA 2024 course.☆36Dec 10, 2024Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- A Zen approach to configuring your Python project☆15Feb 5, 2026Updated last week
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆31Jul 30, 2025Updated 6 months ago
- ☆12Sep 16, 2024Updated last year
- iADMM for a low-rank representation optimization problem☆13Feb 5, 2021Updated 5 years ago
- ☆15Jul 13, 2025Updated 7 months ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated 11 months ago
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆14Nov 11, 2023Updated 2 years ago
- Ἀνατομή is a PyTorch library to analyze representation of neural networks☆13Jan 31, 2024Updated 2 years ago
- Least Squares Regression for subspace clustering☆10May 27, 2018Updated 7 years ago
- ☆12Mar 19, 2021Updated 4 years ago
- Application to generate an RSS feed from your GitHub notifications.☆13Dec 8, 2022Updated 3 years ago
- ☆12Jan 17, 2024Updated 2 years ago
- Design system built with A11Y in mind☆18Jan 20, 2026Updated 3 weeks ago
- This code is used to populate the "ODS jobs dump" Telegram bot, and it can be used for any other dumped Slack channel☆14Sep 12, 2022Updated 3 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- Codes for the paper The emergence of clusters in self-attention dynamics.☆17Dec 18, 2023Updated 2 years ago
- Seminars from 2024 Machine Learning course☆14Mar 9, 2024Updated last year
- Repository containing lectures from 2024 Machine Learning course☆18Feb 29, 2024Updated last year
- PyTorch implementation of "Towards k-means-friendly spaces: Simultaneous deep learning and clustering," Bo Yang et al., 2017.☆17Jan 15, 2021Updated 5 years ago
- Code for verifying deep neural feature ansatz☆21May 3, 2023Updated 2 years ago
- Simple crawler for telegram channels☆17Dec 22, 2023Updated 2 years ago
- Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.☆15Feb 8, 2023Updated 3 years ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- ☆19Dec 12, 2023Updated 2 years ago
- Search index algorithm for GitHub code search☆27Mar 24, 2023Updated 2 years ago
- A state management library for redux☆25Feb 16, 2019Updated 6 years ago
- u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…☆19Jul 2, 2020Updated 5 years ago
- Distributed Optimization: Analysis and Synthesis via Circuits☆22Nov 6, 2024Updated last year
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆21Oct 26, 2023Updated 2 years ago
- Open Statistics and Probability Theory course☆22Aug 31, 2025Updated 5 months ago
- possibly useful materials for learning RWKV language model.☆26Jun 8, 2023Updated 2 years ago
- ☆25Dec 20, 2023Updated 2 years ago
- Standalone implementation of RealMLP-TD-S and its data preprocessing for tabular data classification and regression☆35Jul 8, 2024Updated last year
- Angular builder that allows Terser (Uglify) customization☆21Jan 7, 2023Updated 3 years ago
- Flash Attention in 300-500 lines of CUDA/C++☆36Aug 22, 2025Updated 5 months ago