Hon-Wong / ByteVideoLLMLinks
[ICCV 2025] Dynamic-VLM
☆26Updated last year
Alternatives and similar repositories for ByteVideoLLM
Users that are interested in ByteVideoLLM are comparing it to the libraries listed below
Sorting:
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆64Updated 5 months ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Updated last year
- [ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models