code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning
β20Jul 16, 2024Updated last year
Alternatives and similar repositories for MCL
Users that are interested in MCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΌ Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Expertsβ41Sep 29, 2024Updated last year
- CLIP-MoE: Mixture of Experts for CLIPβ58Oct 10, 2024Updated last year
- The official code repository for the FullFront benchmarkβ27May 16, 2025Updated 11 months ago
- βοΈ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraintsβ79Jul 10, 2025Updated 9 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ64May 22, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Open-Pandora: On-the-fly Control Video Generationβ35Nov 28, 2024Updated last year
- β70Jul 8, 2025Updated 9 months ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"β10Jul 1, 2024Updated last year
- Test-time preferenece optimization (ICML 2025).β182May 8, 2025Updated 11 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ73Jul 13, 2025Updated 9 months ago
- [CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calβ¦β22Jun 6, 2025Updated 10 months ago
- [ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmaβ¦β70Jul 17, 2025Updated 9 months ago
- [FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarksβ170Mar 2, 2026Updated last month
- β130Feb 4, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2025] Official codebase for T2DA: Offline Meta-RL from Natural Language Supervisionβ16Jun 1, 2025Updated 11 months ago
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"β43Oct 9, 2025Updated 6 months ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets andβ¦β67May 16, 2025Updated 11 months ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, β¦β18Dec 30, 2021Updated 4 years ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Networkβ11May 30, 2018Updated 7 years ago
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"β12Oct 31, 2024Updated last year
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025β30Apr 8, 2025Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong Cβ¦β25Mar 9, 2022Updated 4 years ago
- π LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Trainingβ93Dec 3, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.β363Jun 1, 2025Updated 11 months ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025β34Feb 22, 2026Updated 2 months ago
- [CVPR2025] Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Thinkβ24Jul 1, 2025Updated 10 months ago
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Maβ¦β13Sep 13, 2024Updated last year
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)β42May 13, 2022Updated 3 years ago
- β40Jul 20, 2024Updated last year
- β13Oct 21, 2021Updated 4 years ago
- Official implementation of TACCO (Task-guided Co-clustering).β15Aug 31, 2024Updated last year
- Simplified implementation for Domain Seperation Networksβ13Feb 11, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACM MM 2025 π₯π₯ ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contexβ¦β22Aug 28, 2025Updated 8 months ago
- [NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse rewardβ36Sep 19, 2025Updated 7 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"β442Mar 20, 2026Updated last month
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replayβ33Apr 14, 2024Updated 2 years ago
- βοΈ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".β56Feb 23, 2026Updated 2 months ago
- Domain Adaptive Text Style Transfer, EMNLP 2019β70Oct 15, 2019Updated 6 years ago
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoningβ150Updated this week