iOPENCap / awesome-unimodal-trainingView external linksLinks
text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)
☆12Oct 15, 2024Updated last year
Alternatives and similar repositories for awesome-unimodal-training
Users that are interested in awesome-unimodal-training are comparing it to the libraries listed below
Sorting:
- ☆11Oct 2, 2024Updated last year
- Extract features and bounding boxes using the original Bottom-up Attention Faster-RCNN in a few lines of Python code☆11Sep 18, 2022Updated 3 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆29Sep 27, 2024Updated last year
- ☆35Feb 15, 2024Updated 2 years ago
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 5 months ago
- Concurrency library☆16Oct 13, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Develop C++/CUDA extensions with PyTorch like Python scripts☆10Jan 7, 2026Updated last month
- An active inference model of Lacanian psychoanalysis☆15Jun 7, 2025Updated 8 months ago
- ☆10Apr 7, 2024Updated last year
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated 11 months ago
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- Models for packages and the resources they contain.☆14Mar 10, 2024Updated last year
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- ☆18Aug 16, 2025Updated 6 months ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- CANdle - a library for using USB-FDCAN dongle and communicating with md80 drives☆14Sep 15, 2025Updated 5 months ago
- A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)☆11Aug 11, 2025Updated 6 months ago
- Official Implementation of "The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thou…☆14Jul 2, 2025Updated 7 months ago
- ☆11Jan 19, 2025Updated last year
- Interactive, GPU accelerated computation graphs☆12Nov 21, 2024Updated last year
- The source code for "UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All"☆49Apr 4, 2024Updated last year
- Smallest ellipse covering a finite set of points☆14Jan 3, 2025Updated last year
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- ☆12Jun 11, 2024Updated last year
- R package for metabolic enzyme enrichment anaylsis☆13Oct 24, 2025Updated 3 months ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆14Nov 25, 2024Updated last year
- SketchINR: A First Look into Sketches as Implicit Neural Representations [CVPR 2024]☆12Aug 19, 2024Updated last year
- 3D geoms for plotnine (grammar of graphics in Python)☆12Aug 5, 2022Updated 3 years ago
- Artifact for TOSEM Submission: GiantRepair☆12Jun 26, 2024Updated last year
- Tascell: Backtrcking-based load balancing framework☆13Jan 1, 2026Updated last month
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆14Oct 26, 2025Updated 3 months ago
- This repo is for CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering.☆14Mar 6, 2024Updated last year
- [GSI 2023] Learning Lagrangian Fluid Mechanics with E(3)-Equivariant GNNs☆15Jun 3, 2024Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- [IROS2025]Adjacent-view Transformers for Supervised Surround-view Depth Estimation☆14Nov 14, 2025Updated 3 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year