Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind
☆136Nov 1, 2025Updated 6 months ago
Alternatives and similar repositories for PEER-pytorch
Users that are interested in PEER-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mixture of A Million Experts☆54Jul 30, 2024Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆104Dec 22, 2024Updated last year
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Dec 21, 2025Updated 4 months ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆137Apr 28, 2026Updated last week
- Implementation of a multimodal diffusion transformer in Pytorch☆107Jun 24, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of a framework for Genie2 in Pytorch☆157Jan 7, 2025Updated last year
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆120Apr 13, 2026Updated 3 weeks ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- new optimizer☆20Aug 4, 2024Updated last year
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch☆516Dec 20, 2025Updated 4 months ago
- This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.☆107Jul 1, 2024Updated last year
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆53Sep 20, 2025Updated 7 months ago
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆293Jun 3, 2025Updated 11 months ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆548May 16, 2025Updated 11 months ago
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆35Aug 14, 2024Updated last year
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 11 months ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- Implementation of a Light Recurrent Unit in Pytorch☆49Oct 6, 2024Updated last year
- A neural speech codec based on discrete WavLM representations☆26Aug 28, 2024Updated last year
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise☆40Aug 29, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of Autoregressive Diffusion in Pytorch☆437Dec 4, 2025Updated 5 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57May 17, 2024Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Mar 27, 2024Updated 2 years ago
- VS Code Extension for Kaggle☆22Dec 9, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆22Jun 7, 2025Updated 10 months ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Apr 24, 2026Updated last week
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆26Feb 1, 2026Updated 3 months ago
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆104Apr 3, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆345Apr 2, 2025Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆149Aug 22, 2022Updated 3 years ago
- ☆13Jun 11, 2024Updated last year
- Exploration into the Firefly algorithm in Pytorch☆41Feb 14, 2025Updated last year
- Source code for the paper "Do Deep Neural Network Solutions form a Star Domain?"☆12May 26, 2024Updated last year
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year