A unified multimodal model toolkit
☆133May 18, 2026Updated last month
Alternatives and similar repositories for TorchUMM
Users that are interested in TorchUMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated last year
- ☆15Nov 18, 2025Updated 7 months ago
- Template for project development.☆14Updated this week
- ☆15Feb 11, 2025Updated last year
- [CVPR 2026 Highlight] PersonaVLM: Long-Term Personalized Multimodal LLMs☆108Apr 16, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Oct 26, 2021Updated 4 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- Official PyTorch Implementation for Testing of TransZero++(TPAMI'22)☆11Aug 25, 2023Updated 2 years ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆44Mar 8, 2026Updated 3 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 9 months ago
- PyTorch Implementation for InMaP☆12Oct 28, 2023Updated 2 years ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆13Feb 22, 2025Updated last year
- [ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence! | 让你的LLM更好地利用上下文文档:一个基于注意力的简单方案☆29Feb 17, 2025Updated last year
- ☆55Sep 26, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Oct 8, 2022Updated 3 years ago
- official PyTorch implementation of paper "Adversarial Bipartite Graph Learning for Video Domain Adaptation" (MM2020 Oral)☆11Jun 16, 2022Updated 4 years ago
- ICML-2024 highlight paper "Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization"☆19Jul 18, 2024Updated last year
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆11Jun 18, 2024Updated 2 years ago
- Official implementation for "CONVIQT: Contrastive Video Quality Estimator"☆25Jun 14, 2022Updated 4 years ago
- Connectivity-contrastive learning (CCL)☆12Feb 16, 2023Updated 3 years ago
- Code for our tutorial on Discrete Variational Autoencoders☆33May 19, 2025Updated last year
- One Discrete Word for Visual Reasoning Overtakes Agentic and Latent Methods☆133Jun 9, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [WACV 2024] Instruct Me More! Random Prompting for Visual In-Context Learning☆18May 7, 2025Updated last year
- PyTorch Implementation of "BOOTPLACE: Bootstrapped Object Placement with Detection Transformers", CVPR 2025☆28May 18, 2026Updated last month
- 一个面向中国学生(尤其受10043政策影响)的香港、澳门、新加坡等地区导师信息库。An open-source database of professors in HK/MO/SG/etc. for Chinese students (esp. those affected…☆57Nov 26, 2025Updated 7 months ago
- ☆63Apr 8, 2026Updated 2 months ago
- Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).☆13Mar 25, 2022Updated 4 years ago
- ☆15Jun 17, 2026Updated last week
- WHUT Bachelor's Degree Thesis LaTeX Template Wuhan University of Technology Bachelor's Degree Thesis LaTeX Template 武汉理工大学本科生毕业设计(论…☆20Jul 2, 2023Updated 2 years ago
- [NeurIPS 2023] "Learning to Augment Distributions for Out-of-distribution Detection"☆11Nov 14, 2023Updated 2 years ago
- Collection of forcing related autoregressive video Gen☆98Mar 31, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for BYOP [CVPR 2023]☆12Sep 25, 2023Updated 2 years ago
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"☆22Dec 8, 2024Updated last year
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆42Jul 23, 2025Updated 11 months ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆16Feb 22, 2025Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆56Feb 1, 2024Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- Code for the Joint Part-of-Speech Embedding model☆14Feb 16, 2023Updated 3 years ago