Google DeepMind: Mixture of Depths Unofficial Implementation.
☆12May 29, 2024Updated last year
Alternatives and similar repositories for Mixture-of-Depths
Users that are interested in Mixture-of-Depths are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆116Updated this week
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆47Jul 4, 2024Updated last year
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆25Feb 19, 2024Updated 2 years ago
- Explanation of the llama2 repo.☆12Jul 18, 2024Updated last year
- EnSaaS document☆11Oct 1, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Decoupled Neural Interfaces Using Synthetic Gradients - under develeopment☆11Jun 27, 2025Updated 9 months ago
- Testing Difference Target Propagation (DTP) on MNIST.☆12Oct 12, 2020Updated 5 years ago
- ☆15Apr 11, 2024Updated last year
- Advanced audio player component (audix) for Streamlit with waveform visualization and region selection☆14Jun 24, 2025Updated 9 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆175Jun 20, 2024Updated last year
- The Go driver for MongoDB☆15Nov 3, 2021Updated 4 years ago
- DLL注入工具☆12Nov 9, 2020Updated 5 years ago
- developing tools for LIAF-SNNs and LIF-SNNs☆10Sep 14, 2022Updated 3 years ago
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆12Jan 17, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This codes presents examples of constructing primitives for data structures with Hyperdimensional Computing/Vector Symbolic Architectures☆15Jun 4, 2021Updated 4 years ago
- ☆15Jun 26, 2024Updated last year
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5☆16Sep 19, 2024Updated last year
- Unreal Engine 5 3D Platformer game prototype☆17May 27, 2024Updated last year
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- [NeurIPS 2024] Advancing Training Efficiency of Deep Spiking Neural Networks through Rate-based Backpropagation☆19Jan 16, 2025Updated last year
- AI Studio by Metric Coders: A No-Code Software to train, download and deploy Large Language Models.☆12Jul 5, 2024Updated last year
- [3DV 2025] CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences☆19Jan 5, 2026Updated 2 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- PilotFish harvests the free GPU cycles of cloud gaming with deep learning training☆14Jul 2, 2022Updated 3 years ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆27Dec 17, 2024Updated last year
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆26Jan 21, 2026Updated 2 months ago
- Implementation of spiking DQN training using different conversion techniques and backpropagation with surrogate gradients employed on the…☆11Feb 11, 2023Updated 3 years ago
- 基于FISCO-BCOS区块链的供应链demo,使用node.js构建后端☆10Jan 28, 2021Updated 5 years ago
- HACSurv: A Hierarchical Copula-based Approach for Survival Analysis with Dependent Competing Risks☆12Mar 5, 2025Updated last year
- [ECCV 2024] Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning☆16Sep 23, 2024Updated last year
- ☆20Nov 5, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆19Nov 5, 2024Updated last year
- Symbioism: A Third Path for the Intelligence Age☆54Nov 26, 2025Updated 4 months ago
- Create a new backward path for more accurate SNN gradients.☆17Aug 19, 2024Updated last year
- ☆18Nov 26, 2024Updated last year
- ☆16May 23, 2024Updated last year
- ZJU standard C Compiler☆11Dec 18, 2016Updated 9 years ago
- OpenGL 学习代码☆15Jun 25, 2023Updated 2 years ago