Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.
☆16May 30, 2023Updated 2 years ago
Alternatives and similar repositories for Chinese_InstructBLIP
Users that are interested in Chinese_InstructBLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- the implementation of the paper"Building Change Detection for Remote Sensing Images Using a Dual Task Constrained Deep Siamese Convolutio…☆40Nov 16, 2024Updated last year
- The implementation of the paper "Detecting Building Changes with Off-Nadir Aerial Images"☆76Feb 1, 2024Updated 2 years ago
- Grouping and Recognize speaker from an animation video. 从动漫中提取每一个说话人。☆13May 8, 2024Updated last year
- New Modeling The Background CodeBase☆15Jan 7, 2022Updated 4 years ago
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆16Apr 3, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 骆驼大乱斗: Massive Game Content Generated by LLM☆19Oct 15, 2023Updated 2 years ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆116Mar 25, 2026Updated last month
- clip retrieval benchmark☆17May 4, 2022Updated 4 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…☆15May 5, 2024Updated 2 years ago
- OpenAI CLIP based image generator with complex config file controlled transformation and training pipelines☆19Jan 4, 2022Updated 4 years ago
- Managed L2D tool libs. (In Dev)☆14Apr 20, 2019Updated 7 years ago
- ☆12Sep 19, 2021Updated 4 years ago
- 人工智能实验五:多模态情感分类☆16Jul 14, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Dataset and code implementation for the paper "Decoding the Underlying Meaning of Multimodal Hateful Memes" (IJCAI'23).☆20Jun 15, 2023Updated 2 years ago
- A challenge on Semi-Supervised and Reinforced Task-Oriented Dialog Systems, Co-located with EMNLP2022 SereTOD Workshop☆26Oct 1, 2022Updated 3 years ago
- 基于langchain设计的智能体任务,包含规划会话场景资源,构建子任务,任务执行器包含(MCTS)☆33Nov 10, 2025Updated 5 months ago
- image caption with semantic attention☆11Apr 1, 2017Updated 9 years ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆20Jul 24, 2023Updated 2 years ago
- ☆95Feb 2, 2026Updated 3 months ago
- ✅4g GPU可用 | 简易实现ChatGLM单机调用多个计算设备(GPU、CPU)进行推理☆34Apr 20, 2023Updated 3 years ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆40Nov 5, 2023Updated 2 years ago
- Systematic generalization test for CLEVR☆15Mar 11, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Codebase for the paper Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models☆13Oct 3, 2023Updated 2 years ago
- ☆14Feb 17, 2023Updated 3 years ago
- [ACM MM 2023] QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation☆13Jun 14, 2024Updated last year
- Interactive, high-performance 3D visualization app. With Computer Vision in mind.☆14Mar 18, 2023Updated 3 years ago
- Words and their images in 98 languages☆14Mar 1, 2019Updated 7 years ago
- Large Multimodal Model☆15Apr 8, 2024Updated 2 years ago
- 😎 Awesome lists of papers and codes about Large Vision-Language Models☆13Apr 1, 2024Updated 2 years ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆25Apr 24, 2024Updated 2 years ago
- Pytorch implementation of SCAN: Learning Hierarchical Compositional Visual Concepts, Higgins et al., ICLR 2018☆11Oct 10, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Shaping Visual Representations with Language for Few-shot Classification, ACL 2020☆16May 9, 2021Updated 4 years ago
- ☆17Feb 22, 2024Updated 2 years ago
- 📄 ACL 2024: RGCL, Retrieval-Guided Contrastive Learning for Hateful Meme Detection 📄 EMNLP 2025 (Oral): RA-HMD, Robust Adaptation of La…☆37Mar 1, 2026Updated 2 months ago
- Recent vision transformer-based domain adaptation papers☆15Mar 17, 2022Updated 4 years ago
- Official implementation and checkpoints of GeoLink remote sensing foundation model in NeurIPS2025.☆59Oct 6, 2025Updated 7 months ago
- ☆15Sep 30, 2023Updated 2 years ago
- ☆26Apr 5, 2024Updated 2 years ago