Study group / research-padawan community for the misfits
☆34Oct 15, 2025Updated 6 months ago
Alternatives and similar repositories for ml_misfits
Users that are interested in ml_misfits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This code is used to populate the "ODS jobs dump" Telegram bot, and it can be used for any other dumped Slack channel☆14Sep 12, 2022Updated 3 years ago
- Skoltech NLA 2024 course.☆37Dec 10, 2024Updated last year
- Search index algorithm for GitHub code search☆31Mar 24, 2023Updated 3 years ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆19Mar 7, 2025Updated last year
- ☆14Jul 13, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆11Dec 30, 2024Updated last year
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 9 months ago
- "Functional Java by Example" articles, which you can find on my blog.☆11Nov 30, 2019Updated 6 years ago
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆15Nov 11, 2023Updated 2 years ago
- A Zen approach to configuring your Python project☆17Feb 27, 2026Updated 2 months ago
- ☆12Sep 16, 2024Updated last year
- ☆12Jun 30, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 11 months ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Open Statistics and Probability Theory course☆22Aug 31, 2025Updated 8 months ago
- Least Squares Regression for subspace clustering☆11May 27, 2018Updated 7 years ago
- ☆12Mar 19, 2021Updated 5 years ago
- iADMM for a low-rank representation optimization problem☆13Feb 5, 2021Updated 5 years ago
- Basic Webpack config (Pug, SCSS, JS, TS, SVG supporting)☆14Mar 1, 2021Updated 5 years ago
- Simple crawler for telegram channels☆18Dec 22, 2023Updated 2 years ago
- Conditional Linear Dynamical Systems☆16Oct 7, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Personal solutions to the Triton Puzzles☆21Jul 18, 2024Updated last year
- PyTorch implementation of "Towards k-means-friendly spaces: Simultaneous deep learning and clustering," Bo Yang et al., 2017.☆17Jan 15, 2021Updated 5 years ago
- Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022☆18Nov 23, 2022Updated 3 years ago
- ☆27Dec 20, 2020Updated 5 years ago
- Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.☆14Feb 8, 2023Updated 3 years ago
- OSX and Ubuntu phonetic Russian keyboards on Windows☆13Mar 23, 2020Updated 6 years ago
- 1.2% test error on MNIST using only least squares and numpy calls.☆22Sep 13, 2023Updated 2 years ago
- u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…☆19Jul 2, 2020Updated 5 years ago
- Data Science, Visualization, and Predictive Analytics: Kaggle Dataset - 1.6M accidents & traffic flow over 16 years☆17Feb 10, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- possibly useful materials for learning RWKV language model.☆26Jun 8, 2023Updated 2 years ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆19Nov 19, 2024Updated last year
- Linear Algebra Course being taught in HSE in 2020/2021 (in russian)☆32Apr 25, 2022Updated 4 years ago
- Schema-based HTTP client powered by axios. Written in Typescript. Heavily inspired by AngularJS' $resource.☆17Feb 17, 2025Updated last year
- ☆19Dec 12, 2023Updated 2 years ago
- ☆26Dec 20, 2023Updated 2 years ago
- ☆55Jul 24, 2025Updated 9 months ago