saxenarohit / MovieSumLinks

☆16

Alternatives and similar repositories for MovieSum

Users that are interested in MovieSum are comparing it to the libraries listed below

Sorting:

xverse-ai / XVERSE-MoE-A36B
XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.
☆38Updated last year
RhapsodyAILab / MiniCPM-V-Embedding
☆29Updated last year
shulin16 / MMInA
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆47Updated 9 months ago
SparksJoe / Prism
A Framework for Decoupling and Assessing the Capabilities of VLMs
☆43Updated last year
vaew / SkyScript-100M
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2
☆129Updated last year
jdf-prog / LLM-Engines
☆50Updated 5 months ago
SkyworkAI / MindLink
☆98Updated 3 months ago
MBZUAI-LLM / web2code
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
☆97Updated last year
neulab / MultiUI
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆53Updated 11 months ago
lucasjinreal / ImageTokenizer
imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…
☆37Updated last year
TIGER-AI-Lab / One-Shot-CFT
The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]
☆33Updated 3 months ago
MetaStone-AI / MetaStone-S1
The open-source code of MetaStone-S1.
☆107Updated 4 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
FudanNLPLAB / MouSi
☆75Updated last year
18907305772 / FuseAI
FuseAI Project
☆87Updated 10 months ago
StarRing2022 / R1-Nature
最简易的R1结果在小模型上的复现，阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证，对于强推理能力，think思考过程性内容是AGI/ASI的核心。
☆44Updated 9 months ago
Alpha-VLLM / WeMix-LLM
☆17Updated 2 years ago
SHI-Labs / VisPer-LM
[NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation, arXiv 2024
☆66Updated last month
360CVGroup / 360VL
Our 2nd-gen LMM
☆34Updated last year
InternLM / AlchemistCoder
☆35Updated last year
nengelmann / Fuyu-8B---Exploration
Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍
☆27Updated 2 years ago
lunyiliu / CoachLM
Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.
☆60Updated last year
bigai-nlco / TokenSwift
[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation
☆118Updated 6 months ago
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆43Updated 9 months ago
HumanMLLM / HumanOmniV2
☆141Updated 4 months ago
BytedanceDouyinContent / SAIL-VL2
The SAIL-VL2 series model developed by the BytedanceDouyinContent Group
☆75Updated 2 months ago
Bui1dMySea / MemLong
☆95Updated last year
patrick-tssn / Awesome-Colorful-LLM
Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…
☆124Updated 6 months ago
PKU-YuanGroup / LLaVA-o1
☆56Updated last year
sterzhang / image-textualization
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)
☆169Updated last year