A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.
☆38Sep 9, 2024Updated last year
Alternatives and similar repositories for Bumblebee
Users that are interested in Bumblebee are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆38Jun 20, 2024Updated 2 years ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆40Jun 22, 2024Updated 2 years ago
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Mar 12, 2025Updated last year
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"☆11Aug 10, 2023Updated 2 years ago
- Collection of evaluation code for natural language generation.☆12Jan 6, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- ☆13Jun 10, 2025Updated last year
- 通过浏览器渲染生成表格图像☆238Apr 10, 2024Updated 2 years ago
- ✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models☆163Dec 26, 2024Updated last year
- [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"☆154Sep 10, 2024Updated last year
- Third-party toolkit for Rope3D dataset☆13Jun 13, 2022Updated 4 years ago
- ☆62Jul 21, 2025Updated 11 months ago
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Mar 21, 2022Updated 4 years ago
- Unsupervised Anomaly Detection via Deep Metric Learning with End-to-End Optimization☆12Mar 23, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tensorflow实现低俗图片检测☆14Apr 26, 2019Updated 7 years ago
- 本项目旨在构建一套多场景下可复用的辅助决策型智能 Agent 系统。通过提取用户输入的关键信息,结合历史数据进行智能匹配,系统可在教育路径、法律咨询、金融投资、心理健康、企业经营、供应链优化、危机应对、智能客服等多个领域提供个性化决策建议。系统采用统一的决策流程设计,具备高…☆27Mar 7, 2026Updated 3 months ago
- ☆13Mar 16, 2021Updated 5 years ago
- pytorch大规模数据读取dataset☆13May 30, 2022Updated 4 years ago
- [NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context☆174Sep 25, 2024Updated last year
- ☆44Feb 8, 2026Updated 4 months ago
- ☆48Feb 7, 2025Updated last year
- 更纯粹、更高压缩率的Tokenizer☆489Nov 27, 2024Updated last year
- ☆21Feb 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Apr 8, 2022Updated 4 years ago
- ☆23Jan 8, 2024Updated 2 years ago
- Composed Video Retrieval☆62May 2, 2024Updated 2 years ago
- The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…☆13Jul 28, 2025Updated 11 months ago
- [TCSVT 2026] Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception☆30May 24, 2026Updated last month
- paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/☆272Aug 9, 2023Updated 2 years ago
- Implementation of the Spatio-Temporal Hierarchical Matching Pursuit (ST-HMP) descriptor presented in the paper: M. Madry, L. Bo, D. Kragi…☆14Aug 4, 2014Updated 11 years ago
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Apr 24, 2024Updated 2 years ago
- Multilingual and Multiculture Benchmark and LLM☆42May 18, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A collection of visual instruction tuning datasets.☆77Mar 14, 2024Updated 2 years ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆79Jul 13, 2024Updated last year
- 🛩️ Soothing pastel theme for File Pilot☆22Jun 20, 2026Updated 2 weeks ago
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- ☆30May 13, 2024Updated 2 years ago
- ☆70Jun 26, 2024Updated 2 years ago
- ☆13Sep 25, 2020Updated 5 years ago