code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning
β20Jul 16, 2024Updated last year
Alternatives and similar repositories for MCL
Users that are interested in MCL are comparing it to the libraries listed below
Sorting:
- πΌ Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Expertsβ41Sep 29, 2024Updated last year
- CLIP-MoE: Mixture of Experts for CLIPβ56Oct 10, 2024Updated last year
- The official code repository for the FullFront benchmarkβ27May 16, 2025Updated 10 months ago
- βοΈ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraintsβ79Jul 10, 2025Updated 8 months ago
- Open-Pandora: On-the-fly Control Video Generationβ35Nov 28, 2024Updated last year
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β88Feb 15, 2025Updated last year
- β68Jul 8, 2025Updated 8 months ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"β10Jul 1, 2024Updated last year
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ71Jul 13, 2025Updated 8 months ago
- [CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calβ¦β21Jun 6, 2025Updated 9 months ago
- [ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmaβ¦β69Jul 17, 2025Updated 8 months ago
- [FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarksβ163Mar 2, 2026Updated 2 weeks ago
- Adversarial Category Alignment Network for Cross-domain Sentiment Classification (NAACL 2019)β23Jul 4, 2019Updated 6 years ago
- [KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generationsβ148Feb 6, 2026Updated last month
- β125Feb 4, 2026Updated last month
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"β18Mar 15, 2024Updated 2 years ago
- β17Aug 7, 2024Updated last year
- A Theano implementation of a CNN DSEBM (deep structured energy-based model) described in https://arxiv.org/pdf/1605.07717v2.pdfβ10Oct 13, 2016Updated 9 years ago
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"β41Oct 9, 2025Updated 5 months ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets andβ¦β65May 16, 2025Updated 10 months ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, β¦β18Dec 30, 2021Updated 4 years ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Networkβ12May 30, 2018Updated 7 years ago
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong Cβ¦β25Mar 9, 2022Updated 4 years ago
- β18Nov 5, 2016Updated 9 years ago
- β26Jul 10, 2025Updated 8 months ago
- 第εδΊε±ε ¨ε½ε€§ε¦ηζΊθ½ζ±½θ½¦η«θ΅ββε£°ι³δΏ‘ζ η»β11Mar 24, 2022Updated 3 years ago
- π LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Trainingβ93Dec 3, 2024Updated last year
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025β32Feb 22, 2026Updated last month
- β13Apr 4, 2024Updated last year
- [CVPR2025] Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Thinkβ23Jul 1, 2025Updated 8 months ago
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Maβ¦β13Sep 13, 2024Updated last year
- β40Jul 20, 2024Updated last year
- Repository for ''Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale''β14Apr 30, 2024Updated last year
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)β42May 13, 2022Updated 3 years ago
- β14Sep 2, 2023Updated 2 years ago
- TKDE'23: A Survey and Experimental Study on Privacy-Preserving Trajectory Data Publishingβ12May 5, 2023Updated 2 years ago
- β13Oct 21, 2021Updated 4 years ago
- Simplified implementation for Domain Seperation Networksβ13Feb 11, 2023Updated 3 years ago
- This is LaTex PDF(PPT) template for SUSTech, you can use it to perform your presentations.β15Sep 14, 2021Updated 4 years ago