Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model
☆47Jul 23, 2025Updated 10 months ago
Alternatives and similar repositories for Awesome-Large-Vision-Language-Model
Users that are interested in Awesome-Large-Vision-Language-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆16Oct 27, 2024Updated last year
- ☆13Sep 12, 2017Updated 8 years ago
- ☆17Oct 30, 2022Updated 3 years ago
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- Awesome Incremental / Continual / Lifelong Generative Learning☆21Aug 7, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- tensorrt部署教程☆11Aug 1, 2025Updated 9 months ago
- Official code space for "SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development"☆60Oct 24, 2025Updated 7 months ago
- Awesome Mixture of Experts (MoE): A Curated List of Mixture of Experts (MoE) and Mixture of Multimodal Experts (MoME)☆62Oct 6, 2025Updated 7 months ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆23Jul 29, 2024Updated last year
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆24Apr 10, 2025Updated last year
- ☆14Jun 22, 2025Updated 11 months ago
- [ECCV 2024] GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation☆19Oct 5, 2024Updated last year
- ☆15Aug 26, 2024Updated last year
- [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model☆24Nov 1, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch implementation of Graph Attention Networks☆21Sep 10, 2019Updated 6 years ago
- Official repository for "Safety in Large Reasoning Models: A Survey" - Exploring safety risks, attacks, and defenses for Large Reasoning …☆90Aug 25, 2025Updated 9 months ago
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Jan 11, 2026Updated 4 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- ☆13Oct 13, 2025Updated 7 months ago
- DALI datasets split used to train models presented in the paper Multilingual lyrics-to-audio alignment (ISMIR 2020).☆13May 25, 2021Updated 5 years ago
- this repo include paper review, code in face recognition☆17Feb 2, 2020Updated 6 years ago
- ☆13Oct 14, 2025Updated 7 months ago
- geolocation of Twitter users based on text and network information☆14Apr 16, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 😎 Awesome lists of papers and codes about Large Vision-Language Models☆13Apr 1, 2024Updated 2 years ago
- 批量删库,取消star☆12Jan 6, 2021Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- This repository provides you the details of how speech recognition is done from end to end.☆25Apr 22, 2019Updated 7 years ago
- Source code for the paper "Memory-Efficient Fine-Tuning via Low-Rank Activation Compression"☆14Aug 1, 2025Updated 9 months ago
- An implementation of a HMM Ngram language model.☆11Mar 12, 2015Updated 11 years ago
- A comprehensive and efficient long-context model evaluation framework☆31Feb 25, 2026Updated 3 months ago
- solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning☆23Jan 19, 2026Updated 4 months ago
- ☆11Sep 16, 2025Updated 8 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This is the official implementation of "Hard-aware Instance Adaptive Self-training for Unsupervised Cross-domain Semantic Segmentation".☆15Mar 29, 2025Updated last year
- ☆18Apr 11, 2021Updated 5 years ago
- Text-based Geolocation Prediction of Social Media Users with Neural Networks