Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model
☆42Jul 23, 2025Updated 7 months ago
Alternatives and similar repositories for Awesome-Large-Vision-Language-Model
Users that are interested in Awesome-Large-Vision-Language-Model are comparing it to the libraries listed below
Sorting:
- ☆13Sep 12, 2017Updated 8 years ago
- ☆17Oct 30, 2022Updated 3 years ago
- Transformer Encoder with Char information for text classification☆15Jan 17, 2020Updated 6 years ago
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- this repo include paper review, code in face recognition☆17Feb 2, 2020Updated 6 years ago
- PyTorch implementation of Graph Attention Networks☆21Sep 10, 2019Updated 6 years ago
- This repository provides you the details of how speech recognition is done from end to end.☆25Apr 22, 2019Updated 6 years ago
- Source code for the paper "Memory-Efficient Fine-Tuning via Low-Rank Activation Compression"☆13Aug 1, 2025Updated 7 months ago
- Software library RLCM (recursively low-rank compressed matrices)☆14Apr 15, 2021Updated 4 years ago
- Collect and filter location information from social network services.☆11Jun 14, 2020Updated 5 years ago
- Jupyter notebook templates for processing and analyzing neuroscience data.☆14Updated this week
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 11 months ago
- Pytorch version of IEEE Transactions on Multimedia 2019: "Naturalness-Aware Deep No-Reference Image Quality Assessment."☆12Jun 30, 2020Updated 5 years ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 8 months ago
- Implementation of various handwritten text line segmentation☆10Jan 6, 2020Updated 6 years ago
- ☆12Oct 4, 2021Updated 4 years ago
- A graph based image processing and generation tool.☆14Nov 18, 2025Updated 3 months ago
- Fully automatic skin lesion segmentation using the Berkeley wavelet transform and UNet algorithm.☆12Jun 1, 2021Updated 4 years ago
- TensorRT In Docker☆11Dec 7, 2024Updated last year
- TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation☆12Jul 14, 2022Updated 3 years ago
- ☆11Sep 16, 2025Updated 5 months ago
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated 11 months ago
- Code for Neural Style Transfer.☆12Sep 10, 2020Updated 5 years ago
- CUDA Tensor Transpose (cuTT) library☆10Sep 24, 2021Updated 4 years ago
- SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition☆10Apr 8, 2024Updated last year
- Denoising method based on Deep Image Prior and Neural Image Assessment☆10Mar 10, 2020Updated 6 years ago
- Git Deployment with PHP☆35May 16, 2014Updated 11 years ago
- View recent highly-rated albums in the terminal☆11Mar 20, 2023Updated 2 years ago
- Mathematical modelling of Magic the Gathering☆10Aug 9, 2021Updated 4 years ago
- This is a task on Chinese chat title NER via BERT-BiLSTM-CRF model.☆13Dec 15, 2020Updated 5 years ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Implementation of the DocLLM paper for Llama models.☆13Apr 6, 2025Updated 11 months ago
- heterogeneous graph attention network for SMEs bankruptcy prediction☆12Feb 26, 2021Updated 5 years ago
- ☆10May 24, 2020Updated 5 years ago
- Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School☆12Oct 12, 2024Updated last year
- [NeurIPS2022] Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop☆14Apr 13, 2023Updated 2 years ago
- ☆12Dec 15, 2022Updated 3 years ago
- STRExp is a framework that provides Explainability (XAI) to Scene Text Recognition (STR) models.☆11Nov 27, 2023Updated 2 years ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year