Our 2nd-gen LMM
☆34May 22, 2024Updated last year
Alternatives and similar repositories for 360VL
Users that are interested in 360VL are comparing it to the libraries listed below
Sorting:
- LMM solved catastrophic forgetting, AAAI2025☆46Apr 15, 2025Updated 11 months ago
- ☆14Jul 5, 2024Updated last year
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101May 17, 2024Updated last year
- Code for “Adversarial Learning with Local Coordinate Coding”☆16Dec 17, 2019Updated 6 years ago
- [WWW2024 Oral] Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering☆15Apr 22, 2025Updated 11 months ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- CVPR2023: AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning☆18May 19, 2023Updated 2 years ago
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆23Mar 9, 2026Updated last week
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆28Feb 26, 2024Updated 2 years ago
- ☆15Jun 20, 2024Updated last year
- ☆30Aug 21, 2025Updated 7 months ago
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆22Oct 25, 2023Updated 2 years ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 10 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆42Jan 9, 2026Updated 2 months ago
- DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.☆18Nov 4, 2025Updated 4 months ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated 2 years ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- ☆168Nov 9, 2023Updated 2 years ago
- GLM Series Edge Models☆160Jun 12, 2025Updated 9 months ago
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆34Sep 25, 2025Updated 5 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Apr 18, 2021Updated 4 years ago
- ☆22Oct 21, 2024Updated last year
- ☆190Mar 13, 2026Updated last week
- ☆20Jan 6, 2023Updated 3 years ago
- A pre-trained face parser based on SegNeXt☆50May 16, 2023Updated 2 years ago
- ☆28Oct 21, 2025Updated 5 months ago
- ☆19Dec 28, 2020Updated 5 years ago
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆24Oct 2, 2024Updated last year
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 4 months ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- Chinese CLIP models with SOTA performance.☆60Aug 28, 2023Updated 2 years ago
- There are some python scripts processing dataset, inferencing etc. wrote when I am using OpenMMLab.☆18Feb 13, 2023Updated 3 years ago
- Pytorch implementation of WGAN with gradient penalty (WGAN-GP),☆12Feb 7, 2022Updated 4 years ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆82Jul 4, 2024Updated last year
- 2nd place solution to Google Universal Image Embedding Challenge!☆43Nov 11, 2022Updated 3 years ago
- New generation of CLIP with strong fine grained discrimination capability, ICML2025☆559Oct 27, 2025Updated 4 months ago