aassxun / Understanding-Vision-TasksLinks
☆208Updated last month
Alternatives and similar repositories for Understanding-Vision-Tasks
Users that are interested in Understanding-Vision-Tasks are comparing it to the libraries listed below
Sorting:
- Efficient controlnet for DiTs☆379Updated last month
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆157Updated 3 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆59Updated 3 months ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆49Updated 11 months ago
- [ICRA 2024] Official code for BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection☆2Updated 11 months ago
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆67Updated 5 months ago
- [ICML2025] Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment☆104Updated last week
- ☆320Updated 3 months ago
- CVPR2025☆39Updated 3 months ago
- ☆206Updated 2 months ago
- ☆150Updated 8 months ago
- Improving Generalist Model with Domain-Specific Experts☆85Updated 5 months ago
- Official code of the paper "Relational Representation Learning Network for Cross-Spectral Image Patch Matching"☆33Updated 4 months ago
- [ACL 2025] FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation☆27Updated last week
- [Arxiv 2024] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation☆66Updated 11 months ago
- Official implementation of paper "Multi-Level Collaboration in Model Merging"☆41Updated 2 months ago
- ☆161Updated 8 months ago
- Workflow runner engine for argo framework☆100Updated 4 months ago
- A PyTorch implementation of diffusion models built from scratch☆38Updated 2 months ago
- Official repository of MMGenBench☆121Updated 3 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆309Updated 5 months ago
- Tokenize The Virtual Agents Onchain☆240Updated 3 weeks ago
- ☆237Updated 2 weeks ago
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆50Updated 2 months ago
- Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate (NeurIPS 2024)☆31Updated last year
- Building a Q&A LLM Agent to Answer Questions about Your Dataset☆103Updated 3 months ago
- ☆50Updated 3 months ago
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆95Updated 3 months ago
- 职星学院企业培训系统是一套基于点播、直播、考试、培训、面授等功能完善的在线教育系统,开源版是基于商业版精简实现的一个企业员工培训系统,致力于打造一个各行业都适用的在线培训系统、企业培训平台、员工培训系统、企业内部培训系统。☆113Updated last week
- 回国VPN推荐 - 翻墙回国的最佳选择☆53Updated last week