360CVGroup/360VL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/360CVGroup/360VL)

360CVGroup / 360VL

Our 2nd-gen LMM

☆34

Alternatives and similar repositories for 360VL

Users that are interested in 360VL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

360CVGroup / Inner-Adaptor-Architecture
View on GitHub
LMM solved catastrophic forgetting, AAAI2025
☆45Apr 15, 2025Updated last year
lucasjinreal / LLaVA-Magvit2
View on GitHub
LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.
☆38Jun 20, 2024Updated 2 years ago
360CVGroup / RzenEmbed
View on GitHub
Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark
☆36Jun 16, 2026Updated last month
iszry / DI2N-PTQ4DM
View on GitHub
Improved the performance of 8-bit PTQ4DM expecially on FID.
☆11Aug 30, 2023Updated 2 years ago
360CVGroup / SEEChat
View on GitHub
Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM
☆101May 17, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ezosa / M3L-topic-model
View on GitHub
Multimodal and multilingual topic model with pretrained embeddings
☆12Apr 11, 2023Updated 3 years ago
EthanLeo-LYX / LLMQA
View on GitHub
[WWW2024 Oral] Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering
☆15Apr 22, 2025Updated last year
jefferyZhan / GThinker
View on GitHub
[CVPR 2026] GThinker, Reasoning MLLM, Visual Cues, Visual Rethinking
☆18Mar 9, 2026Updated 4 months ago
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 2 months ago
bobxwu / CNPMI
View on GitHub
Cross-lingual Normalized Pointwise Mutual Information for cross-lingual topic evaluation.
☆15Sep 6, 2022Updated 3 years ago
DMU-ITREC / CSRTE-CCL2025
View on GitHub
CCL2025中文语音关系三元组抽取任务（CSRTE）的评测网站
☆10Mar 6, 2025Updated last year
franciscoliu / SKU
View on GitHub
Official code implementation of SKU, Accepted by ACL 2024 Findings
☆20Dec 18, 2024Updated last year
OpenGVLab / OmniCorpus
View on GitHub
[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
☆425May 5, 2025Updated last year
Alsace08 / OOD-Math-Reasoning
View on GitHub
[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"
☆28May 28, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
VITA-Group / triple-wins
View on GitHub
[ICLR 2020] ”Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference“
☆24Dec 30, 2021Updated 4 years ago
percent4 / multi-modal-image-search
View on GitHub
本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。
☆28Feb 26, 2024Updated 2 years ago
will-singularity / Skywork-MM
View on GitHub
Empirical Study Towards Building An Effective Multi-Modal Large Language Model
☆22Oct 25, 2023Updated 2 years ago
t6am3 / law_glm_baseline
View on GitHub
☆15Jun 20, 2024Updated 2 years ago
Kwai-YuanQi / MM-RLHF
View on GitHub
The Next Step Forward in Multimodal LLM Alignment
☆198May 1, 2025Updated last year
vanity1129 / AttriCLIP
View on GitHub
CVPR2023: AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning
☆18May 19, 2023Updated 3 years ago
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
RWKV / ZeroCoT
View on GitHub
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28May 4, 2025Updated last year
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
360CVGroup / LMM-Det
View on GitHub
Make Large Multimodal Models excel in object detection, ICCV 2025
☆65Aug 1, 2025Updated 11 months ago
zai-org / GLM-Edge
View on GitHub
GLM Series Edge Models
☆163Jun 12, 2025Updated last year
language-agent-tutorial / language-agent-tutorial.github.io
View on GitHub
[EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks
☆10Nov 27, 2024Updated last year
FrankYang-17 / Mavors
View on GitHub
☆16May 30, 2025Updated last year
yuxie11 / R2D2
View on GitHub
☆170Nov 9, 2023Updated 2 years ago
DeepLink-org / DeepTrace
View on GitHub
DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.
☆18Nov 4, 2025Updated 8 months ago
ltttpku / CMMP
View on GitHub
☆23Oct 21, 2024Updated last year
heng840 / AMIG
View on GitHub
Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…
☆28Mar 19, 2024Updated 2 years ago
WePOINTS / WePOINTS
View on GitHub
☆189Mar 13, 2026Updated 4 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mindspore-ai / zidongtaichu
View on GitHub
☆22Jan 6, 2023Updated 3 years ago
Nihukat / Concept-Conductor
View on GitHub
☆17Feb 21, 2025Updated last year
e4s2022 / SegNeXt-FaceParser
View on GitHub
A pre-trained face parser based on SegNeXt
☆51May 16, 2023Updated 3 years ago
TencentARC-QQ / QA-CLIP
View on GitHub
Chinese CLIP models with SOTA performance.
☆63Aug 28, 2023Updated 2 years ago
eric-haibin-lin / verl-data
View on GitHub
☆14May 12, 2025Updated last year
richjjj / cuvid-tensorrt-multi
View on GitHub
ffmpeg+cuvid+tensorrt+multicamera
☆12Dec 31, 2024Updated last year
tregtyu78 / Real-time-Object-Detection-Flask-OpenCV-YoloV3
View on GitHub
Web application for real-time object detection 🔎 using Flask 🌶, OpenCV, and YoloV3 weights. It uses the COCO Dataset 🖼.
☆16Apr 19, 2021Updated 5 years ago