☆23Jan 5, 2025Updated last year
Alternatives and similar repositories for diloco-sim
Users that are interested in diloco-sim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EXO Gym is an open-source Python toolkit that facilitates distributed AI research.☆108Dec 1, 2025Updated 5 months ago
- ☆50Jan 18, 2024Updated 2 years ago
- DeMo: Decoupled Momentum Optimization☆201Dec 2, 2024Updated last year
- ☆19May 16, 2026Updated last week
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆120Feb 12, 2024Updated 2 years ago
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 3 years ago
- ☆19Apr 16, 2025Updated last year
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 6 months ago
- ☆16Jan 23, 2025Updated last year
- ☆37Nov 14, 2025Updated 6 months ago
- WorldModel is a MaskGIT model trained on 8x8x8 Minecraft voxel volumes. Beyond generating blocks from scratch, it excels in filling space…☆14Sep 12, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆15Apr 23, 2026Updated last month
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 3 years ago
- ☆10Jun 11, 2019Updated 6 years ago
- Examples to control the Opal C1 from within python.☆17May 7, 2023Updated 3 years ago
- 🔮 Alexa integration with Google Assistant☆10Nov 30, 2018Updated 7 years ago
- Download ebooks from the Project Gutenberg☆14Dec 30, 2024Updated last year
- ☆15Jul 6, 2022Updated 3 years ago
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated last year
- Have an LLM write your biography, probably incorrectly☆14Dec 26, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- ☆20Dec 23, 2025Updated 5 months ago
- ☆13May 7, 2023Updated 3 years ago
- ☆11May 8, 2024Updated 2 years ago
- Substrate TypeScript SDK☆10Sep 20, 2024Updated last year
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- Grams: Gradient Descent with Adaptive Momentum Scaling (ICLR 2025 Workshop)☆17Mar 6, 2025Updated last year
- ☆16Oct 9, 2023Updated 2 years ago
- ☆14Dec 15, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆14Jun 7, 2022Updated 3 years ago
- Extensive time series analysis of chinese PM2.5 content, using models from ARMA and VAR to LSTMs and dynamic time warping clustering☆12Aug 17, 2019Updated 6 years ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last month
- Efficient misspecification uncertainties for linear regression☆18Updated this week
- Public repo for the paper: "Modeling Intensification for Sign Language Generation: A Computational Approach" by Mert Inan*, Yang Zhong*, …☆14Mar 15, 2022Updated 4 years ago
- ☆23Apr 29, 2025Updated last year