Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise
☆40Aug 29, 2024Updated last year
Alternatives and similar repositories for sinkhorn-router-pytorch
Users that are interested in sinkhorn-router-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆55Nov 25, 2024Updated last year
- 삼각형의 실전! Triton☆16Feb 15, 2024Updated 2 years ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Sep 23, 2024Updated last year
- Implementation of Infini-Transformer in Pytorch☆112Jan 4, 2025Updated last year
- Experimental GPU language with meta-programming☆31Sep 6, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Fork of HyenaDNA, a long-range genomic foundation model built with Hyena☆10Aug 14, 2023Updated 2 years ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆137Apr 28, 2026Updated last week
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI☆1,356Jan 27, 2026Updated 3 months ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- ☆50Jan 18, 2024Updated 2 years ago
- ☆21Mar 3, 2025Updated last year
- ☆13Sep 13, 2025Updated 7 months ago
- Implementation of various equivariant models in JAX☆19Apr 12, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Dec 4, 2023Updated 2 years ago
- PaiNN in jax☆11Jan 14, 2025Updated last year
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆178Sep 12, 2024Updated last year
- Implementation of a multimodal diffusion transformer in Pytorch☆107Jun 24, 2024Updated last year
- The source code of paper "Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph" in KDD2022.☆15Jan 9, 2023Updated 3 years ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆105Nov 9, 2024Updated last year
- Implementation of the proposed MaskBit from Bytedance AI☆82Nov 12, 2024Updated last year
- Research code of Cycle Generative Adversarial Networks for Complementary Item Recommendations.☆21Mar 9, 2023Updated 3 years ago
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for Novel View Acoustic Synthesis paper☆54Aug 14, 2023Updated 2 years ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆104Dec 22, 2024Updated last year
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- ☆15Oct 19, 2024Updated last year
- E(n) Equivariant GNN in jax☆14Aug 31, 2023Updated 2 years ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆345Apr 2, 2025Updated last year
- Unofficial Implementation of Selective Attention Transformer☆20Oct 31, 2024Updated last year
- Aerial Detection Toolbox☆11Jan 18, 2023Updated 3 years ago
- ☆14Apr 26, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- (TGRS 2024) OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images☆48Jul 14, 2025Updated 9 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- ☆10Apr 24, 2023Updated 3 years ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆126Jul 26, 2024Updated last year
- Repository for the code of the paper "Neural Networks Regularization Through Class-wise Invariant Representation Learning".☆12Oct 1, 2017Updated 8 years ago
- ☆12Oct 10, 2023Updated 2 years ago
- OLD REPOSITORY, new one at repo.rumpkernel.org/rumprun☆44Apr 13, 2015Updated 11 years ago