☆48Mar 31, 2024Updated 2 years ago
Alternatives and similar repositories for M2
Users that are interested in M2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation for Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability [Neur…☆17Jul 7, 2025Updated 11 months ago
- Simba☆219Mar 24, 2024Updated 2 years ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆216May 11, 2026Updated last month
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- ☆19Dec 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Contains the implementation of the EDAIN and EDAIN-KL methods proposed in our paper. The research was also part of the thesis I wrote as …☆16Feb 19, 2024Updated 2 years ago
- TimeMachine: A Time Series is Worth 4 Mambas for Long-term Forecasting☆221Jul 27, 2024Updated last year
- Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models☆13Mar 9, 2024Updated 2 years ago
- ☆27Jun 4, 2024Updated 2 years ago
- ☆13Jul 10, 2024Updated last year
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆28Apr 1, 2026Updated 2 months ago
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆18Apr 25, 2025Updated last year
- ☆23Oct 22, 2025Updated 7 months ago
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆29Nov 11, 2024Updated last year
- Just a repository that will house some MLPs and their variants, so to avoid having to reimplement them again and again for different proj…☆50Apr 27, 2026Updated last month
- ☆35Aug 19, 2023Updated 2 years ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆14Mar 7, 2026Updated 3 months ago
- Code and Data for Paper: PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation☆83May 31, 2023Updated 3 years ago
- A holistic framework for advancing LLMs as data science agents☆49May 19, 2026Updated 3 weeks ago
- An operation trying to do the opposite of F.grid_sample☆20Aug 8, 2023Updated 2 years ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 8 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆232Oct 16, 2025Updated 7 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Implementation of RevIN is based on TF2.Keras and PyTorch.☆30Aug 4, 2023Updated 2 years ago
- Unsupervised diverse image generation via GANs: Partition Guided Mixture of Generative Adversarial Networks☆13Nov 3, 2021Updated 4 years ago
- ☆49Dec 1, 2025Updated 6 months ago
- ☆18May 13, 2025Updated last year
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆50Aug 19, 2024Updated last year
- Bayesian optimization with Standard Gaussian Processes on high dimensional benchmarks☆23Jun 29, 2025Updated 11 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- [ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks☆31Feb 5, 2026Updated 4 months ago
- [NeurIPS 2024] The implementation for the paper "Geometric Trajectory Diffusion Models".☆39Jul 22, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- The official repo of continuous speculative decoding☆34Mar 28, 2025Updated last year
- An End-to-end Transformer for Alzheimer's Disease Detection☆22Aug 5, 2025Updated 10 months ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆17May 27, 2024Updated 2 years ago
- Official implementation of AAAI-2024 paper "Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain"☆13Jun 17, 2024Updated last year
- This is official Pytorch implementation of SPGFusion☆21Sep 2, 2025Updated 9 months ago