JerryYin777 / Jerry_CVLinks
☆10Updated 2 years ago
Alternatives and similar repositories for Jerry_CV
Users that are interested in Jerry_CV are comparing it to the libraries listed below
Sorting:
- ☆19Updated last year
- ☆125Updated last year
- ☆152Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆233Updated last year
- Sharing my research toolchain☆87Updated 2 years ago
- toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts☆28Updated last year
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆168Updated 7 months ago
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆64Updated 8 months ago
- Unleashing Reasoning in Medical Large Language Models☆12Updated 10 months ago
- Code release for VTW (AAAI 2025 Oral)☆64Updated 2 months ago
- Recent Advances on MLLM's Reasoning Ability☆26Updated 9 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆152Updated 6 months ago
- The official repository for "Rongsheng Wang's Arxiv Template"☆55Updated 8 months ago
- ☆20Updated 8 months ago
- Awesome Low-Rank Adaptation☆59Updated 5 months ago
- ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence☆40Updated 5 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆85Updated 7 months ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆60Updated last year
- ☆43Updated last year
- 学术双语简历模 板,涵盖教育背景、论文发表、项目经历、竞赛经历和个人陈述等关键部分,可适用于申请研究生项目、学术职位或相关行业岗位。☆172Updated 7 months ago
- Awesome-Low-Rank-Adaptation☆127Updated last year
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆60Updated 6 months ago
- Large language model review prompts☆366Updated last month
- OOD Generalization相关文章的阅读笔记☆35Updated last year
- ☆56Updated last year
- CLIP-MoE: Mixture of Experts for CLIP☆55Updated last year
- ☆37Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆89Updated 11 months ago
- classification and solutions for PKU-CSSummerCamp-OnlineJudge☆22Updated 2 years ago
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆67Updated last year