☆19Jan 10, 2025Updated last year
Alternatives and similar repositories for flex_gen
Users that are interested in flex_gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 8 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆30Jul 24, 2025Updated 10 months ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- ☆24Sep 25, 2024Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆57Aug 20, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Feb 4, 2025Updated last year
- [CVPR'25] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization☆49Jul 22, 2025Updated 10 months ago
- PyTorch implementation of "Kernel Neural Optimal Transport" (ICLR 2023)☆34Oct 3, 2023Updated 2 years ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 3 years ago
- ☆11Jul 30, 2025Updated 9 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- The official repo for the DanQing dataset.☆36Mar 25, 2026Updated 2 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- Official implementation of (ICML 2026) Vector Quantization using Gaussian Variational Autoencoder☆23Jan 3, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Represe…☆27Jun 24, 2024Updated last year
- Limited automatic tabular ML pipelines for generic MEDS datasets.☆18Apr 14, 2026Updated last month
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆42Sep 8, 2025Updated 8 months ago
- ☆13Sep 13, 2023Updated 2 years ago
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆29May 3, 2025Updated last year
- Awesome autoregressive vision foundation models☆26Dec 24, 2024Updated last year
- ☆16Dec 23, 2023Updated 2 years ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆16May 13, 2025Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Log-Polar Space Convolution for Convolutional Neural Networks☆13Dec 12, 2022Updated 3 years ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆19Oct 18, 2025Updated 7 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- [ICME 2021 Oral] Official implementation for "FGF-GAN: A Lightweight Generative Adversarial Network for Pansharpening via Fast Guided Fil…☆10Mar 29, 2022Updated 4 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 9 months ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- Repository for the NeurIPS 2023 paper "Beyond Confidence: Reliable Models Should Also Consider Atypicality"☆13Apr 21, 2024Updated 2 years ago
- 根据Qwen2(Qwen1.5)模型生成qwen2 MoE模型的工具☆15Mar 29, 2024Updated 2 years ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆23Jun 15, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- High-performance Image Tokenizers for VAR and AR☆307Apr 25, 2025Updated last year
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- ☆145Jun 28, 2024Updated last year
- MoVQGAN - model for the image encoding and reconstruction☆265Oct 31, 2023Updated 2 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- ☆21Mar 18, 2026Updated 2 months ago
- Latest and fastest EigenPro that scales to billions of examples☆9Apr 18, 2026Updated last month