Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.
☆16May 30, 2023Updated 2 years ago
Alternatives and similar repositories for Chinese_InstructBLIP
Users that are interested in Chinese_InstructBLIP are comparing it to the libraries listed below
Sorting:
- Change Detection towards Bitemporal Quality Difference via Hierarchical Correlation Distillation☆10Apr 30, 2024Updated last year
- the implementation of the paper"Building Change Detection for Remote Sensing Images Using a Dual Task Constrained Deep Siamese Convolutio…☆40Nov 16, 2024Updated last year
- The implementation of the paper "Detecting Building Changes with Off-Nadir Aerial Images"☆75Feb 1, 2024Updated 2 years ago
- MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder☆51Aug 16, 2025Updated 6 months ago
- 李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。☆10Jun 3, 2025Updated 9 months ago
- CCL2025中文语音关系三元组抽取任务(CSRTE)的评 测网站☆11Mar 6, 2025Updated last year
- ☆13Sep 7, 2023Updated 2 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- DeepEarth: AI Foundation Model for Planetary Science & Sustainability☆26Feb 26, 2026Updated last week
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆16Apr 3, 2025Updated 11 months ago
- ☆12Sep 19, 2021Updated 4 years ago
- Part-of-speech tagging using BERT☆10Nov 14, 2019Updated 6 years ago
- 😎 Awesome lists of papers and codes about Large Vision-Language Models☆13Apr 1, 2024Updated last year
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- image caption with semantic attention☆11Apr 1, 2017Updated 8 years ago
- Interactive, high-performance 3D visualization app. With Computer Vision in mind.☆13Mar 18, 2023Updated 2 years ago
- Codebase for the paper Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models☆13Oct 3, 2023Updated 2 years ago
- Chinese CLIP models with SOTA performance.☆60Aug 28, 2023Updated 2 years ago
- Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…☆15May 5, 2024Updated last year
- ☆13Feb 17, 2023Updated 3 years ago
- https://aiisc.ai/defactify2/factify.html☆15Nov 27, 2023Updated 2 years ago
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆27May 27, 2025Updated 9 months ago
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 4 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- Official implementation of "On the Effectiveness of Lipschitz-Driven Rehearsal in Continual Learning"☆15Oct 13, 2022Updated 3 years ago
- [NeurIPS2024] BoostAdapter: Improving Test-Time Adaptation via Regional Bootstrapping☆18Mar 13, 2025Updated 11 months ago
- Paper List on Earth Observation in the Foundation Model Era☆28Updated this week
- ☆14Sep 30, 2023Updated 2 years ago
- ☆16Mar 23, 2021Updated 4 years ago
- 非官方的科大讯飞语音合成(用于朗读,配音场景)python API (基于官方demo增加了:超过2000字上限自动分割再合并音频的功能)☆17Mar 21, 2024Updated last year
- 人工智能实验五:多模态情感分类☆16Jul 14, 2022Updated 3 years ago
- Run CLIP inference on the ImageNet dataset and use these inferences as labels to train other models and again evaluate the trained model …☆12Jun 21, 2021Updated 4 years ago
- Recent vision transformer-based domain adaptation papers☆15Mar 17, 2022Updated 3 years ago
- This repo provides the code for volumetric tsdf fusion for scannet dataset☆17Dec 12, 2019Updated 6 years ago
- Repo of NeurIPS23☆18Oct 25, 2023Updated 2 years ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆15Jan 2, 2023Updated 3 years ago
- This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.☆20Dec 1, 2023Updated 2 years ago
- ☆18Nov 12, 2024Updated last year
- Shaping Visual Representations with Language for Few-shot Classification, ACL 2020☆16May 9, 2021Updated 4 years ago