Google DeepMind: Mixture of Depths Unofficial Implementation.
☆12May 29, 2024Updated last year
Alternatives and similar repositories for Mixture-of-Depths
Users that are interested in Mixture-of-Depths are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆117Apr 13, 2026Updated last week
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆47Jul 4, 2024Updated last year
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆25Feb 19, 2024Updated 2 years ago
- Explanation of the llama2 repo.☆12Jul 18, 2024Updated last year
- EnSaaS document☆11Oct 1, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Decoupled Neural Interfaces Using Synthetic Gradients - under develeopment☆11Jun 27, 2025Updated 9 months ago
- Testing Difference Target Propagation (DTP) on MNIST.☆13Oct 12, 2020Updated 5 years ago
- ☆15Apr 11, 2024Updated 2 years ago
- Advanced audio player component (audix) for Streamlit with waveform visualization and region selection☆14Jun 24, 2025Updated 9 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆175Jun 20, 2024Updated last year
- The Go driver for MongoDB☆15Nov 3, 2021Updated 4 years ago
- DLL注入工具☆12Nov 9, 2020Updated 5 years ago
- developing tools for LIAF-SNNs and LIF-SNNs☆10Sep 14, 2022Updated 3 years ago
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆13Jan 17, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This codes presents examples of constructing primitives for data structures with Hyperdimensional Computing/Vector Symbolic Architectures☆15Jun 4, 2021Updated 4 years ago
- ☆15Jun 26, 2024Updated last year
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5☆16Sep 19, 2024Updated last year
- Unreal Engine 5 3D Platformer game prototype☆18May 27, 2024Updated last year
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- ☆78Mar 12, 2026Updated last month
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- [NeurIPS 2024] Advancing Training Efficiency of Deep Spiking Neural Networks through Rate-based Backpropagation☆19Jan 16, 2025Updated last year
- AI Studio by Metric Coders: A No-Code Software to train, download and deploy Large Language Models.☆12Jul 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [3DV 2025] CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences☆19Jan 5, 2026Updated 3 months ago
- PilotFish harvests the free GPU cycles of cloud gaming with deep learning training☆14Jul 2, 2022Updated 3 years ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆27Dec 17, 2024Updated last year
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆26Jan 21, 2026Updated 2 months ago
- Implementation of spiking DQN training using different conversion techniques and backpropagation with surrogate gradients employed on the…☆11Feb 11, 2023Updated 3 years ago
- 基于FISCO-BCOS区块链的供应链demo,使用node.js构建后端☆10Jan 28, 2021Updated 5 years ago
- HACSurv: A Hierarchical Copula-based Approach for Survival Analysis with Dependent Competing Risks☆13Mar 5, 2025Updated last year
- [ECCV 2024] Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning☆18Sep 23, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Nov 5, 2019Updated 6 years ago
- ☆19Nov 5, 2024Updated last year
- Create a new backward path for more accurate SNN gradients.☆17Aug 19, 2024Updated last year
- ☆18Nov 26, 2024Updated last year
- ZJU standard C Compiler☆11Dec 18, 2016Updated 9 years ago
- ☆16May 23, 2024Updated last year
- Symbioism: A Third Path for the Intelligence Age☆53Nov 26, 2025Updated 4 months ago