MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
☆271Jun 18, 2026Updated 2 weeks ago
Alternatives and similar repositories for MOSS-VL
Users that are interested in MOSS-VL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments☆38Feb 17, 2025Updated last year
- A minimal, educational HEVC (H.265) encoder written in Python.☆53Feb 23, 2026Updated 4 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Official code of "RoboOmni: Proactive Robot Manipulation in Omni-modal Context"☆113Mar 28, 2026Updated 3 months ago
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆14Nov 26, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Summaries of ICML 2024 papers☆12Jul 31, 2024Updated last year
- ☆163Mar 30, 2026Updated 3 months ago
- Chroma key (green screen removal) algorithms with Python☆11Jul 14, 2024Updated last year
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning☆155Jun 19, 2026Updated 2 weeks ago
- ☆14Nov 24, 2023Updated 2 years ago
- [COLING22] Text-to-Text Extraction and Verbalization of Biomedical Event Graphs☆10Nov 5, 2022Updated 3 years ago
- [NeurIPS 2024] Can Language Models Learn to Skip Steps?☆21Jan 25, 2025Updated last year
- ☆95Oct 21, 2025Updated 8 months ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Jul 27, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]☆90May 8, 2026Updated last month
- ☆50Jun 4, 2026Updated last month
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- Music Language Model Generation, Optimization, and Practice☆61Apr 20, 2026Updated 2 months ago
- Java面试总结☆19May 11, 2020Updated 6 years ago
- OpenMMLab Detection Toolbox and Benchmark☆11Aug 1, 2023Updated 2 years ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- Just prepare config file and start training your metric learning model with ease☆16May 20, 2026Updated last month
- SimKO: Simple Pass@K Policy Optimization☆31Oct 24, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆10Sep 25, 2024Updated last year
- The official repo for the DanQing dataset.☆36Mar 25, 2026Updated 3 months ago
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆16Jul 15, 2025Updated 11 months ago
- ☆12Aug 10, 2022Updated 3 years ago
- dataset☆19Jul 20, 2023Updated 2 years ago
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆30Nov 18, 2025Updated 7 months ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated 2 years ago
- 启智平台(qz.sii.edu.cn)的 Agent 驾驶舱:Skill + CLI,一条命令直达。Agent cockpit for the Inspire ML platform — one command, every operation, straight from…☆153Updated this week
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Biological Information Extraction from Large Language Models (LLMs) (Journal of Computational Biology 2025)☆13Jun 18, 2025Updated last year
- ☆11Oct 16, 2023Updated 2 years ago
- [ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…☆18Jul 22, 2025Updated 11 months ago
- Official repository for the UAE paper, unified-GRPO, and unified-Bench☆165Sep 12, 2025Updated 9 months ago
- ☆79May 4, 2025Updated last year
- [MM 2025] The official implementation code for "VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injectio…☆38Apr 4, 2026Updated 3 months ago
- ☆13Jul 14, 2024Updated last year