A unified multimodal model toolkit
☆119May 18, 2026Updated 3 weeks ago
Alternatives and similar repositories for TorchUMM
Users that are interested in TorchUMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated last year
- ☆15Nov 18, 2025Updated 6 months ago
- Template for project development.☆14Jun 1, 2026Updated last week
- ☆15Feb 11, 2025Updated last year
- [CVPR 2026 Highlight] PersonaVLM: Long-Term Personalized Multimodal LLMs☆104Apr 16, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆16Jul 4, 2022Updated 3 years ago
- Official PyTorch Implementation for Testing of TransZero++(TPAMI'22)☆11Aug 25, 2023Updated 2 years ago
- ☆15Jan 9, 2026Updated 4 months ago
- [ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence! | 让你的LLM更好地利用上下文文档:一个基于注意力的简单方案☆29Feb 17, 2025Updated last year
- ☆54Sep 26, 2025Updated 8 months ago
- ☆55Apr 8, 2026Updated 2 months ago
- official PyTorch implementation of paper "Adversarial Bipartite Graph Learning for Video Domain Adaptation" (MM2020 Oral)☆11Jun 16, 2022Updated 3 years ago
- ICML-2024 highlight paper "Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization"☆19Jul 18, 2024Updated last year
- Official implementation for "CONVIQT: Contrastive Video Quality Estimator"☆25Jun 14, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Connectivity-contrastive learning (CCL)☆12Feb 16, 2023Updated 3 years ago
- Full model implementation for Flow Equivariant World Models (ICML 2026), world models with memory for dynamic scenes☆43May 21, 2026Updated 2 weeks ago
- 一个面向中国学生(尤其受10043政策影响)的香港、澳门、新加坡等地区导师信息库。An open-source database of professors in HK/MO/SG/etc. for Chinese students (esp. those affected…☆55Nov 26, 2025Updated 6 months ago
- Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).☆13Mar 25, 2022Updated 4 years ago
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- WHUT Bachelor's Degree Thesis LaTeX Template Wuhan University of Technology Bachelor's Degree Thesis LaTeX Template 武汉理工大学本科生毕业设计(论…☆20Jul 2, 2023Updated 2 years ago
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 10 months ago
- ICML2025☆65Aug 28, 2025Updated 9 months ago
- Code for the Joint Part-of-Speech Embedding model☆14Feb 16, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2026] Segment Anything Across Shots: A Method and Benchmark☆30Nov 16, 2025Updated 6 months ago
- ☆17Jun 17, 2021Updated 4 years ago
- Python library (C++ backend) for degree-preserving network randomization☆14Oct 14, 2019Updated 6 years ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆22Jul 20, 2024Updated last year
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆107Dec 9, 2024Updated last year
- the code is used to do face detection☆19Jul 19, 2017Updated 8 years ago
- ☆12Dec 10, 2019Updated 6 years ago
- Fast CUDA implementation of (differentiable) otam for PyTorch using Numba☆16Jun 21, 2021Updated 4 years ago
- Identify hierarchical community structure in networks using consensus clustering☆37Mar 13, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Graph Convolutional Network Hashing for Cross-Modal Retrieval, IJCAI2019☆13Mar 14, 2021Updated 5 years ago
- ☆18Jun 15, 2019Updated 6 years ago
- Gantry provides an API that streamlines running experiments in Beaker☆33Apr 8, 2026Updated last month
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 8 months ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆18Jul 21, 2024Updated last year
- The official repo for [AAAI 2024] "SimDistill: Simulated Multi-modal Distillation for BEV 3D Object Detection""☆43May 16, 2024Updated 2 years ago
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆23Feb 1, 2025Updated last year