☆18Jul 10, 2024Updated last year
Alternatives and similar repositories for VITATECS
Users that are interested in VITATECS are comparing it to the libraries listed below
Sorting:
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- ☆16Apr 11, 2022Updated 3 years ago
- Evaluation of generated videos on the FETV benchmark☆10Apr 6, 2025Updated 11 months ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- [Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapter☆12Sep 4, 2023Updated 2 years ago
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆16Sep 27, 2023Updated 2 years ago
- Code for CascadeBERT, Findings of EMNLP 2021☆12Mar 30, 2022Updated 3 years ago
- Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022☆14Mar 31, 2023Updated 2 years ago
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated 2 years ago
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆41Aug 9, 2022Updated 3 years ago
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆71Aug 8, 2025Updated 6 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Oct 10, 2023Updated 2 years ago
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Jan 9, 2024Updated 2 years ago
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, …☆129Apr 4, 2025Updated 11 months ago
- (ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆49Jun 4, 2025Updated 9 months ago
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆49Jan 9, 2024Updated 2 years ago
- ☆25Feb 27, 2023Updated 3 years ago
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Mar 4, 2024Updated 2 years ago
- my commonly-used tools☆64Jan 7, 2025Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Group Meeting Record for Baobao Chang Group in Peking University☆26May 17, 2021Updated 4 years ago
- [ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval☆34May 16, 2023Updated 2 years ago
- Machine translation experiments for WMT18 en-zh track☆32Apr 27, 2021Updated 4 years ago
- Methods and evaluation for aligning language models temporally☆30Mar 2, 2024Updated 2 years ago
- A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!☆137Dec 31, 2023Updated 2 years ago
- 🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.☆153Jan 16, 2026Updated last month
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- ☆32Oct 17, 2022Updated 3 years ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆41Feb 12, 2025Updated last year
- a large scientific paraphrase dataset for longer paraphrase generation☆39Oct 17, 2022Updated 3 years ago
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆34Aug 12, 2024Updated last year
- [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"☆151Sep 10, 2024Updated last year
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆33Nov 11, 2025Updated 3 months ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- ☆12Jun 26, 2024Updated last year
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- Official implementation of "Attention-aware semantic communications for collaborative inference” (IEEE IoTJ 2024)☆13Jan 22, 2026Updated last month
- Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence☆279Updated this week