Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.
☆16May 30, 2023Updated 2 years ago
Alternatives and similar repositories for Chinese_InstructBLIP
Users that are interested in Chinese_InstructBLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Change Detection towards Bitemporal Quality Difference via Hierarchical Correlation Distillation☆10Apr 30, 2024Updated last year
- the implementation of the paper"Building Change Detection for Remote Sensing Images Using a Dual Task Constrained Deep Siamese Convolutio…☆40Nov 16, 2024Updated last year
- The implementation of the paper "Detecting Building Changes with Off-Nadir Aerial Images"☆76Feb 1, 2024Updated 2 years ago
- New Modeling The Background CodeBase☆15Jan 7, 2022Updated 4 years ago
- CCL2025中文语音关系三元组抽取任务(CSRTE)的评测网站☆11Mar 6, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder☆51Aug 16, 2025Updated 8 months ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆113Mar 25, 2026Updated 3 weeks ago
- DeepEarth: AI Foundation Model for Planetary Science & Sustainability☆28Apr 9, 2026Updated last week
- clip retrieval benchmark☆17May 4, 2022Updated 3 years ago
- (TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation☆22Aug 8, 2024Updated last year
- This repo provides the code for volumetric tsdf fusion for scannet dataset☆17Dec 12, 2019Updated 6 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- ☆14Sep 7, 2023Updated 2 years ago
- Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…☆15May 5, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- TALLO database☆39Oct 24, 2022Updated 3 years ago
- OpenAI CLIP based image generator with complex config file controlled transformation and training pipelines☆19Jan 4, 2022Updated 4 years ago
- Managed L2D tool libs. (In Dev)☆13Apr 20, 2019Updated 6 years ago
- ☆12Sep 19, 2021Updated 4 years ago
- https://aiisc.ai/defactify2/factify.html☆15Nov 27, 2023Updated 2 years ago
- A challenge on Semi-Supervised and Reinforced Task-Oriented Dialog Systems, Co-located with EMNLP2022 SereTOD Workshop☆26Oct 1, 2022Updated 3 years ago
- Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.☆21Jan 2, 2024Updated 2 years ago
- Run CLIP inference on the ImageNet dataset and use these inferences as labels to train other models and again evaluate the trained model …☆12Jun 21, 2021Updated 4 years ago
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆28May 27, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆17May 16, 2022Updated 3 years ago
- Chinese CLIP models with SOTA performance.☆60Aug 28, 2023Updated 2 years ago
- ☆13Feb 17, 2023Updated 3 years ago
- Codebase for the paper Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models☆13Oct 3, 2023Updated 2 years ago
- Data backup to several cloud storage in security.☆23Jun 4, 2012Updated 13 years ago
- Words and their images in 98 languages☆14Mar 1, 2019Updated 7 years ago
- ☆12Mar 13, 2020Updated 6 years ago
- Large-Scale High-quality Chinese Web Text with Multi-dimensional and fine-grained information☆38Dec 2, 2024Updated last year
- Large Multimodal Model☆15Apr 8, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Code for the WWW'24 Paper: "Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models"☆25Apr 16, 2025Updated last year
- 😎 Awesome lists of papers and codes about Large Vision-Language Models☆13Apr 1, 2024Updated 2 years ago
- Paper List on Earth Observation in the Foundation Model Era☆30Updated this week
- 仇恨言论语料库☆28Jun 12, 2023Updated 2 years ago
- Pytorch implementation of SCAN: Learning Hierarchical Compositional Visual Concepts, Higgins et al., ICLR 2018☆11Oct 10, 2018Updated 7 years ago
- Shaping Visual Representations with Language for Few-shot Classification, ACL 2020☆16May 9, 2021Updated 4 years ago
- ☆17Feb 22, 2024Updated 2 years ago