An open source community implementation of the model from "DIFFERENTIAL TRANSFORMER" paper by Microsoft.
☆41May 12, 2026Updated last week
Alternatives and similar repositories for DifferentialTransformer
Users that are interested in DifferentialTransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model …☆86Oct 27, 2024Updated last year
- ☆13Oct 14, 2024Updated last year
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆17May 11, 2026Updated last week
- ☆23Apr 16, 2025Updated last year
- [WOSAC 2025] Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework☆22Jul 26, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The code will come soon.☆16Sep 12, 2025Updated 8 months ago
- ☆11Oct 24, 2024Updated last year
- 实时交互输入辅助工具☆10Apr 7, 2022Updated 4 years ago
- Automatic Modulation Classification implemented on different deep learning frameworks☆10Nov 17, 2020Updated 5 years ago
- ☆18Apr 7, 2025Updated last year
- Investigating and Mitigating the Side Effects of Noisy Views for Self-Supervised Clustering Algorithms in Practical Multi-View Scenarios☆11Mar 21, 2024Updated 2 years ago
- Implementation of the paper: "Aurora: A Foundation Model of the Atmosphere" in PyTorch☆24May 12, 2026Updated last week
- Components loss for neural networks in mask-based speech enhancement☆33Nov 20, 2020Updated 5 years ago
- ☆21Mar 25, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Defending AI-Based Automatic Modulation Recognition Models Against Adversarial Attacks☆11Jan 11, 2025Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Apr 24, 2026Updated 3 weeks ago
- ☆13Sep 26, 2023Updated 2 years ago
- [L4DC 2025] Morphological-Symmetry-Equvariant Heterogeneous Graph Neural Network for Robotic Dynamics Learning☆19Dec 6, 2025Updated 5 months ago
- ☆11Nov 24, 2023Updated 2 years ago
- 🛠Robust SSH: auto-reconnect SSH session that preserves your running shell and command. Intuitive, no server-side setup, aimed at simplic…☆13Nov 14, 2025Updated 6 months ago
- A ROS 2 framework for humanoid robot simulation and control, developed by the Computational Robotics Lab (CRL) at ETH Zurich.☆46Feb 10, 2026Updated 3 months ago
- decouped imitation for whole-body humanoid natural locomotion☆16Apr 1, 2025Updated last year
- ACL 2024 (SRW), Official Codebase of our Paper: "MoExtend: Tuning New Experts for Modality and Task Extension"☆15Dec 3, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Isaac Sim 4.5 intellisense and AI context for VSCode & Cursor☆18Aug 13, 2025Updated 9 months ago
- ☆15Jan 2, 2025Updated last year
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆182Jul 21, 2025Updated 10 months ago
- 北邮课程设计与大作业合集☆12Mar 25, 2024Updated 2 years ago
- Phoneme recognizer based on long temporal context (with ALIZE VAD command added)☆17Apr 7, 2012Updated 14 years ago
- ☆14Mar 6, 2023Updated 3 years ago
- Official repository of "Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion" (ACMMM 2024)☆15Oct 31, 2024Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆30May 12, 2026Updated last week
- In this project, we have developed a basic CNN model which is used for "Automatic Modulation Classification" using constellation diagrams…☆17Jun 29, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆16Feb 5, 2024Updated 2 years ago
- Official repository for the paper "Automating Continual Learning"☆19Jun 11, 2025Updated 11 months ago
- [IROS'25] TrajFlow: Multi-modal Motion Prediction via Flow Matching☆71Dec 24, 2025Updated 4 months ago
- Unified framework for robot learning built on NVIDIA Isaac Sim☆15Apr 29, 2026Updated 3 weeks ago
- Leveraging BERT to Improve Spoken Language Identification☆18Nov 22, 2022Updated 3 years ago
- Model-Agnostic Adaptive Testing☆10Dec 16, 2020Updated 5 years ago
- [IEEE TKDE] A LLM-based Recommender System with user&item Tokenizers and a generative retrieval paradigm.☆26Mar 11, 2026Updated 2 months ago