yangjianxin1 / OFA-ChineseView external linksLinks
transformers结构的中文OFA模型
☆139Feb 13, 2023Updated 3 years ago
Alternatives and similar repositories for OFA-Chinese
Users that are interested in OFA-Chinese are comparing it to the libraries listed below
Sorting:
- 中文CLIP预训练模型☆423Dec 5, 2022Updated 3 years ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,555Apr 24, 2024Updated last year
- SCRFD face detection based on MNN inference framework☆17Sep 22, 2021Updated 4 years ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆190Nov 17, 2023Updated 2 years ago
- DPNet: Dual-Path Network for Real-time Object Detection with Lightweight Attention☆21May 18, 2022Updated 3 years ago
- VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)☆194Mar 13, 2023Updated 2 years ago
- 基于ClipCap的看图说话Image Caption模型☆321Apr 1, 2022Updated 3 years ago
- Hopenet: deep head pose estimator on ncnn☆10Jun 18, 2020Updated 5 years ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101May 17, 2024Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆72Dec 7, 2023Updated 2 years ago
- ☆13Apr 2, 2024Updated last year
- Finetune Bloom big language model with Lora method☆32Jun 9, 2023Updated 2 years ago
- 支持中英文双语视觉-文本对话的开源可商用多模态模型。☆378Sep 23, 2023Updated 2 years ago
- Karras et al. (2022) diffusion models for PyTorch☆17Oct 5, 2023Updated 2 years ago
- Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs☆54Oct 7, 2025Updated 4 months ago
- OmniSyn: Synthesizing 360 Videos with Wide-baseline Panoramas (VRW 2022)☆14Mar 5, 2024Updated last year
- ☆13Dec 23, 2019Updated 6 years ago
- ☆32Aug 26, 2025Updated 5 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Jan 29, 2024Updated 2 years ago
- Optimized pose detector inference for edge devices☆15Feb 23, 2023Updated 2 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,795Aug 29, 2025Updated 5 months ago
- ☆88Jul 4, 2024Updated last year
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。☆4,148Aug 13, 2024Updated last year
- PromptCLUE, 全中文任务支持零样本学习模型☆665Jun 16, 2023Updated 2 years ago
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆36Nov 9, 2025Updated 3 months ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆98Oct 29, 2022Updated 3 years ago
- A unified tokenization tool for Images, Chinese and English.☆153Mar 23, 2023Updated 2 years ago
- something like visual-chatgpt, 文心一言的开源版☆1,199Feb 24, 2024Updated last year
- ☆17Feb 19, 2024Updated last year
- ☆13Jun 3, 2020Updated 5 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 2 years ago
- ☆16Jan 30, 2020Updated 6 years ago
- ☆17May 24, 2022Updated 3 years ago
- ☆19May 11, 2024Updated last year
- Concise, Modular, Human-friendly PyTorch implementation of MixNet with Pre-trained Weights.☆19Mar 24, 2020Updated 5 years ago
- The code in "SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design"☆42Oct 20, 2025Updated 3 months ago
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆36Jul 22, 2025Updated 6 months ago
- chatglm 6b finetuning and alpaca finetuning☆1,537Mar 9, 2025Updated 11 months ago