Sphere-AI-Lab / fdaLinks
Model Merging with Functional Dual Anchors
☆31Updated last week
Alternatives and similar repositories for fda
Users that are interested in fda are comparing it to the libraries listed below
Sorting:
- The official repo of continuous speculative decoding☆30Updated 7 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆103Updated last week
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 6 months ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).☆47Updated last month
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆26Updated 3 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆114Updated 5 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆40Updated 7 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆145Updated this week
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆47Updated 8 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆51Updated 8 months ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆46Updated 2 months ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆74Updated last year
- Multimodal RewardBench☆54Updated 8 months ago
- ☆35Updated 7 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated last year
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆77Updated 10 months ago
- 🕹️ Explore cutting-edge techniques in game generation☆49Updated 2 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆103Updated last year
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 9 months ago
- Train vector quantized CLIP models using pytorch lightning☆20Updated last year
- ☆17Updated 2 years ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆46Updated 4 months ago
- ☆76Updated 4 months ago
- Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆166Updated last month
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆75Updated 3 months ago
- [ICLR 2024 Spotlight] Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Communi…☆11Updated last year
- ☆13Updated 9 months ago
- Geometric-Mean Policy Optimization☆89Updated 3 weeks ago
- The official implementation of Recurrent Diffusion for Large-Scale Parameter Generation.☆70Updated last month