☆23Jan 5, 2025Updated last year
Alternatives and similar repositories for diloco-sim
Users that are interested in diloco-sim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆48Jan 18, 2024Updated 2 years ago
- DeMo: Decoupled Momentum Optimization☆198Dec 2, 2024Updated last year
- ☆14Apr 16, 2025Updated 11 months ago
- ☆16Oct 20, 2025Updated 5 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- ☆11Jul 21, 2024Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆118Feb 12, 2024Updated 2 years ago
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- [ICML2024] "FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees" by Jiaha…☆14Sep 22, 2024Updated last year
- Code for the paper "Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages" (N…☆17Apr 13, 2025Updated 11 months ago
- ☆22Aug 25, 2025Updated 6 months ago
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 4 months ago
- ☆14Jan 23, 2025Updated last year
- A Python interface to the MAGMA libraries☆10Sep 3, 2016Updated 9 years ago
- ☆14Jan 12, 2023Updated 3 years ago
- Download ebooks from the Project Gutenberg☆13Dec 30, 2024Updated last year
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated last year
- A NEW VERSION OF MIXING SECRETS DATASET FOR MUSIC SOURCE SEPARATION☆21Mar 3, 2023Updated 3 years ago
- Have an LLM write your biography, probably incorrectly☆14Dec 26, 2024Updated last year
- A blitz project to try to mint an onchain NFT in the last ever POW block and first ever POS block.☆30Sep 14, 2022Updated 3 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- ☆20Dec 23, 2025Updated 3 months ago
- The world's fastest Python package for calculating integrated loudness (LUFS) from audio data as NumPy arrays☆25Dec 26, 2025Updated 2 months ago
- ☆13May 7, 2023Updated 2 years ago
- ☆13Jan 15, 2025Updated last year
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆33Dec 5, 2025Updated 3 months ago
- Source code for the experiments of Trainable Fractional Fourier Transform paper submitted to IEEE Signal Processing Letters.☆19Jun 27, 2024Updated last year
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆14Jun 7, 2022Updated 3 years ago
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 3 months ago
- Extensive time series analysis of chinese PM2.5 content, using models from ARMA and VAR to LSTMs and dynamic time warping clustering☆11Aug 17, 2019Updated 6 years ago
- Tools for the LLaMA language model☆12Apr 4, 2023Updated 2 years ago
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- An Implementation of LoRa for EmComm (Emergency Communication) or (TacComm) Tactical Communication☆19Jul 23, 2025Updated 8 months ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- ☆24Apr 29, 2025Updated 10 months ago
- Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models☆34Sep 19, 2025Updated 6 months ago
- This includes 2 separate tutorial series for OpenAI swarm library each 10 files from basic to advanced☆14Jan 14, 2025Updated last year
- torch implementation of diloco☆22May 31, 2024Updated last year