code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning
β20Jul 16, 2024Updated last year
Alternatives and similar repositories for MCL
Users that are interested in MCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΌ Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Expertsβ41Sep 29, 2024Updated last year
- CLIP-MoE: Mixture of Experts for CLIPβ58Oct 10, 2024Updated last year
- The official code repository for the FullFront benchmarkβ27May 16, 2025Updated last year
- βοΈ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraintsβ80Jul 10, 2025Updated 11 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ66May 22, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Open-Pandora: On-the-fly Control Video Generationβ35Nov 28, 2024Updated last year
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β92Feb 15, 2025Updated last year
- β71Jul 8, 2025Updated 11 months ago
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)β26Oct 23, 2024Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"β11Jul 1, 2024Updated last year
- Test-time preferenece optimization (ICML 2025).β185May 8, 2025Updated last year
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ75Jul 13, 2025Updated 11 months ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalizationβ19Mar 7, 2025Updated last year
- Adversarial Category Alignment Network for Cross-domain Sentiment Classification (NAACL 2019)β23Jul 4, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarksβ180May 12, 2026Updated last month
- β138Feb 4, 2026Updated 4 months ago
- β139Jun 6, 2025Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"β18Mar 15, 2024Updated 2 years ago
- [NeurIPS 2025] Official codebase for T2DA: Offline Meta-RL from Natural Language Supervisionβ17Jun 1, 2025Updated last year
- β18Aug 7, 2024Updated last year
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"β50Oct 9, 2025Updated 8 months ago
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025β31Apr 8, 2025Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong Cβ¦β25Mar 9, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)β32Jul 3, 2024Updated last year
- The official repository for TensorFlow 2.0 implementation of MetaTTE.β10Mar 9, 2022Updated 4 years ago
- π LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Trainingβ93Dec 3, 2024Updated last year
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.β392Jun 1, 2025Updated last year
- β10Aug 3, 2021Updated 4 years ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025β35Feb 22, 2026Updated 4 months ago
- [CVPR2025] Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Thinkβ24Jul 1, 2025Updated last year
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Maβ¦β12Sep 13, 2024Updated last year
- Reproduction of LLaVA-v1.5 based on Llama-3-8b LLM backbone.β64Oct 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Repository for ''Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale''β14Apr 30, 2024Updated 2 years ago
- β16Sep 2, 2023Updated 2 years ago
- A video retrieval dataset How2R and a video QA dataset How2QAβ24Oct 15, 2020Updated 5 years ago
- Official implementation of TACCO (Task-guided Co-clustering).β16Aug 31, 2024Updated last year
- β13Oct 21, 2021Updated 4 years ago
- TKDE'23: A Survey and Experimental Study on Privacy-Preserving Trajectory Data Publishingβ12May 5, 2023Updated 3 years ago
- [ACM MM 2025 π₯π₯ ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contexβ¦β23Aug 28, 2025Updated 10 months ago