Google DeepMind: Mixture of Depths Unofficial Implementation.
☆12May 29, 2024Updated 2 years ago
Alternatives and similar repositories for Mixture-of-Depths
Users that are interested in Mixture-of-Depths are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆121May 19, 2026Updated 3 weeks ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆47Jul 4, 2024Updated last year
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆25Feb 19, 2024Updated 2 years ago
- Explanation of the llama2 repo.☆12Jul 18, 2024Updated last year
- EnSaaS document☆11Oct 1, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Decoupled Neural Interfaces Using Synthetic Gradients - under develeopment☆11Jun 27, 2025Updated 11 months ago
- Testing Difference Target Propagation (DTP) on MNIST.☆13Oct 12, 2020Updated 5 years ago
- ☆15Apr 11, 2024Updated 2 years ago
- Advanced audio player component (audix) for Streamlit with waveform visualization and region selection☆14Jun 24, 2025Updated 11 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆175Jun 20, 2024Updated last year
- The Go driver for MongoDB☆15Nov 3, 2021Updated 4 years ago
- DLL注入工具☆12Nov 9, 2020Updated 5 years ago
- developing tools for LIAF-SNNs and LIF-SNNs☆10Sep 14, 2022Updated 3 years ago
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆13Jan 17, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This codes presents examples of constructing primitives for data structures with Hyperdimensional Computing/Vector Symbolic Architectures☆15Jun 4, 2021Updated 5 years ago
- ☆15Jun 26, 2024Updated last year
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5☆16Sep 19, 2024Updated last year
- Unreal Engine 5 3D Platformer game prototype☆19May 27, 2024Updated 2 years ago
- ☆85Mar 12, 2026Updated 2 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- [NeurIPS 2024] Advancing Training Efficiency of Deep Spiking Neural Networks through Rate-based Backpropagation☆20Jan 16, 2025Updated last year
- AI Studio by Metric Coders: A No-Code Software to train, download and deploy Large Language Models.☆12Jul 5, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [3DV 2025] CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences☆20Jan 5, 2026Updated 5 months ago
- PilotFish harvests the free GPU cycles of cloud gaming with deep learning training☆14Jul 2, 2022Updated 3 years ago
- [Preprint] Self-Adversarial One Step Generation via Condition Shifting☆52Apr 15, 2026Updated last month
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆29Dec 17, 2024Updated last year
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- 基于FISCO-BCOS区块链的供应链demo,使用node.js构建后端☆10Jan 28, 2021Updated 5 years ago
- Implementation of spiking DQN training using different conversion techniques and backpropagation with surrogate gradients employed on the…☆11Feb 11, 2023Updated 3 years ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆27Jan 21, 2026Updated 4 months ago
- HACSurv: A Hierarchical Copula-based Approach for Survival Analysis with Dependent Competing Risks☆13Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ECCV 2024] Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning☆18Sep 23, 2024Updated last year
- ☆20Nov 5, 2019Updated 6 years ago
- ☆20Nov 5, 2024Updated last year
- ☆18Nov 26, 2024Updated last year
- Create a new backward path for more accurate SNN gradients.☆17Aug 19, 2024Updated last year
- ZJU standard C Compiler☆11Dec 18, 2016Updated 9 years ago
- ☆16May 23, 2024Updated 2 years ago