A real-time video understanding foundation model built on Llama-3.2-Vision, featuring comprehensively extended video processing and multimodal reasoning capabilities.
☆138Apr 13, 2026Updated 2 weeks ago
Alternatives and similar repositories for MOSS-Video-Preview
Users that are interested in MOSS-Video-Preview are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 国立台湾大学:机器学习 HUNG-YI LEE (李宏毅)☆13May 6, 2023Updated 2 years ago
- Skills and tools for automatically writing and optimizing CUDA kernels☆83Apr 24, 2026Updated last week
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated last year
- This project converts Anthropic's @anthropic-ai/sdk into an OpenAI-style API interface, providing seamless compatibility for Claude-Code …☆35Sep 4, 2025Updated 7 months ago
- ☆16Dec 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MADRL project solving chess environment using PPO with two different methods: 2 agents/networks and a single agent/network.☆21Apr 1, 2023Updated 3 years ago
- Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings☆18Oct 17, 2025Updated 6 months ago
- 复现论文《Pixel-Anchor: A Fast Oriented Scene Text Detector with Combined Networks》☆26Nov 26, 2018Updated 7 years ago
- InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference☆17Mar 30, 2025Updated last year
- Introduction to Data Science and Engineering - 2023 Autumn☆26Jan 2, 2024Updated 2 years ago
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆24Jun 13, 2025Updated 10 months ago
- Paper Reading of IMCC groups.☆17Oct 22, 2025Updated 6 months ago
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering☆18Oct 31, 2024Updated last year
- ☆17Apr 15, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2024] Continuous Temporal Domain Generalization☆53Mar 10, 2025Updated last year
- [CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering☆22Sep 21, 2024Updated last year
- ☆42Oct 11, 2025Updated 6 months ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆24Jan 6, 2026Updated 3 months ago
- Official implementation of "TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization" (Findings of ACL …☆21Jul 25, 2025Updated 9 months ago
- 基于 THU-Beamer-Theme (https://github.com/Trinkle23897/THU-Beamer-Theme) 删删改改而成的☆37May 27, 2022Updated 3 years ago
- ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression (DAC'25)☆27Feb 26, 2026Updated 2 months ago
- SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models☆231Updated this week
- A management system for recruitment of Unique Studio☆14Sep 18, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2025] 𝓡𝓣𝓥-𝓑𝓮𝓷𝓬𝓱: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video.☆33Jan 15, 2026Updated 3 months ago
- InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models☆105Apr 20, 2026Updated last week
- 虚假新闻检测多模态识别第一名解决方案☆42Oct 25, 2019Updated 6 years ago
- [ICLR2025] Code and data for paper: Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasonin…☆42Mar 10, 2025Updated last year
- A practical astrodynamics for research and engineering applications☆72Updated this week
- This is a drop-in Keras layer for ELMo embeddings.☆47Dec 29, 2018Updated 7 years ago
- 2019达观杯 第六名代码☆44Feb 15, 2023Updated 3 years ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆37Jul 11, 2024Updated last year
- Recurrent Convolutional Neural Networks in Keras☆50Mar 18, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆50Jan 8, 2025Updated last year
- ☆71Sep 21, 2021Updated 4 years ago
- Automatically update arXiv papers about SOT & VLT, Multi-modal Learning, LLM and Video Understanding using Github Actions.☆44Updated this week
- 互联网新闻情感分析赛题baseline☆42Sep 18, 2019Updated 6 years ago
- ☆41Oct 16, 2025Updated 6 months ago
- Scene Text Detection with Learned Anchor☆67Jan 10, 2020Updated 6 years ago
- A cross-platform high-performance provably-safe sandboxing Wasm-to-native compiler☆44Feb 23, 2026Updated 2 months ago