A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.
☆38Sep 9, 2024Updated last year
Alternatives and similar repositories for Bumblebee
Users that are interested in Bumblebee are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- 大学Latex答辩模版,当前包含川大、哈工大、中科大。☆11Jul 22, 2024Updated last year
- An Implementation of Deep Exhaustive Model for Nested NER☆15Jul 19, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Collection of evaluation code for natural language generation.☆12Jan 6, 2021Updated 5 years ago
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- ☆13Jun 10, 2025Updated last year
- 通过浏览器渲染生成表格图像☆238Apr 10, 2024Updated 2 years ago
- ☆20May 14, 2024Updated 2 years ago
- The application of large pre-trained vision model DINOv2 from MetaAI for feature points matching, and a ViT decoder used for Auto Encoder☆18Apr 27, 2023Updated 3 years ago
- Free-form Description-guided 3D Visual Graph Networks for Object Grounding in Point Cloud☆17Jun 23, 2022Updated 3 years ago
- A demo app that shows you how to use Vue & the Typesense InstantSearch adapter, to build rich search interfaces.☆11Jan 23, 2024Updated 2 years ago
- Official implementation of ViT-VS: A visual servoing approach that leverages pretrained vision transformers for semantic feature extracti…☆33Dec 10, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Apr 23, 2025Updated last year
- ✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models☆164Dec 26, 2024Updated last year
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆21Dec 14, 2025Updated 6 months ago
- The official PyTorch implementation of SEMv3.☆52May 26, 2024Updated 2 years ago
- GIAC 2019 全球互联网架构大会☆14Feb 28, 2020Updated 6 years ago
- Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)☆171Jul 30, 2024Updated last year
- ☆62Jul 21, 2025Updated 10 months ago
- A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.☆256Apr 22, 2025Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Data and codes for BioBERT-MRC☆11Oct 5, 2021Updated 4 years ago
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Mar 21, 2022Updated 4 years ago
- 本项目旨在构建一套多场景下可复用的辅助决策型智能 Agent 系统。通过提取用户输入的关键信息,结合历史数据进行智能匹配,系统可在教育路径、法律咨询、金融投资、心理健康、企业经营、供应链优化、危机应对、智能客服等多个领域提供个性化决策建议。系统采用统一的决策流程设计,具备高…☆27Mar 7, 2026Updated 3 months ago
- 主要是用python进行生存分析的步骤,包括生存分析(逐步和单因素),KM曲线、决策曲线,ROC曲线,训练测试样本分布比较☆11Dec 21, 2020Updated 5 years ago
- Using the Python Imaging Library (PIL, now Pillow) to generate colors and animate Moiré patterns.☆18Sep 9, 2025Updated 9 months ago
- 智枢多模态应急减灾智能平台,基于哈工大优势学科,深度融合卫星遥感、产业分布、物联网感知、社交媒体等多源异构数据,构建了包括洪水模型,气象模型,地震模型,野火模型等在内的智能体集群,精确识别灾情、量化评估灾损,实现灾害管理,填补我国巨灾模型多智能体平台的空白☆35Aug 15, 2025Updated 9 months ago
- [NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context☆174Sep 25, 2024Updated last year
- ☆41Feb 8, 2026Updated 4 months ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆13Apr 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 毕业设计:基于AI+GraphCast的智慧城市与气象多元融合云应用平台☆16May 5, 2024Updated 2 years ago
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆21Aug 23, 2025Updated 9 months ago
- Formal geometric problem solver based on FormalGeo.☆19Apr 18, 2024Updated 2 years ago
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 9 months ago
- ☆48Feb 7, 2025Updated last year
- 更纯粹、更高压缩率的Tokenizer☆488Nov 27, 2024Updated last year
- ☆21Feb 29, 2024Updated 2 years ago