code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning
β20Jul 16, 2024Updated last year
Alternatives and similar repositories for MCL
Users that are interested in MCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΌ Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Expertsβ41Sep 29, 2024Updated last year
- CLIP-MoE: Mixture of Experts for CLIPβ58Oct 10, 2024Updated last year
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ66May 22, 2025Updated last year
- Open-Pandora: On-the-fly Control Video Generationβ35Nov 28, 2024Updated last year
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β92Feb 15, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β70Jul 8, 2025Updated 11 months ago
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)β26Oct 23, 2024Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"β10Jul 1, 2024Updated last year
- [CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calβ¦β22Jun 6, 2025Updated last year
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalizationβ19Mar 7, 2025Updated last year
- Adversarial Category Alignment Network for Cross-domain Sentiment Classification (NAACL 2019)β23Jul 4, 2019Updated 6 years ago
- β132Feb 4, 2026Updated 4 months ago
- β137Jun 6, 2025Updated last year
- [KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations [ICMLβ¦β177Mar 29, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"β18Mar 15, 2024Updated 2 years ago
- [NeurIPS 2025] Official codebase for T2DA: Offline Meta-RL from Natural Language Supervisionβ17Jun 1, 2025Updated last year
- β18Aug 7, 2024Updated last year
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"β48Oct 9, 2025Updated 8 months ago
- A Theano implementation of a CNN DSEBM (deep structured energy-based model) described in https://arxiv.org/pdf/1605.07717v2.pdfβ10Oct 13, 2016Updated 9 years ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets andβ¦β69May 16, 2025Updated last year
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, β¦β18Dec 30, 2021Updated 4 years ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Networkβ11May 30, 2018Updated 8 years ago
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"β12Oct 31, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025β31Apr 8, 2025Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong Cβ¦β25Mar 9, 2022Updated 4 years ago
- β27Jul 10, 2025Updated 11 months ago
- The official repository for TensorFlow 2.0 implementation of MetaTTE.β10Mar 9, 2022Updated 4 years ago
- π LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Trainingβ93Dec 3, 2024Updated last year
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.β390Jun 1, 2025Updated last year
- β10Aug 3, 2021Updated 4 years ago
- β13Apr 4, 2024Updated 2 years ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025β35Feb 22, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)β42May 13, 2022Updated 4 years ago
- β40Jul 20, 2024Updated last year
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Maβ¦β12Sep 13, 2024Updated last year
- Repository for ''Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale''β14Apr 30, 2024Updated 2 years ago
- β16Sep 2, 2023Updated 2 years ago
- A video retrieval dataset How2R and a video QA dataset How2QAβ24Oct 15, 2020Updated 5 years ago
- TKDE'23: A Survey and Experimental Study on Privacy-Preserving Trajectory Data Publishingβ12May 5, 2023Updated 3 years ago