Gen-Verse/Paper2Video

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Gen-Verse/Paper2Video)

Gen-Verse / Paper2Video

[ICCV 2025] Preacher: Paper-to-Video Agentic System

☆50

Alternatives and similar repositories for Paper2Video

Users that are interested in Paper2Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AhmedImtiazPrio / magnet-polarity
View on GitHub
Official repository for Polarity Sampling, CVPR 2022 ORAL
☆13Jul 25, 2022Updated 3 years ago
Gen-Verse / dLLM-RL
View on GitHub
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
☆511Jan 28, 2026Updated 5 months ago
SHI-Labs / T2I-Copilot
View on GitHub
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)
☆57Oct 6, 2025Updated 9 months ago
Kamaleswaran-Lab / Clin-JEPA
View on GitHub
☆17Jun 15, 2026Updated last month
arnav-gudibande / conceptSHAP
View on GitHub
PyTorch Transformer-based Language Model Implementation of ConceptSHAP
☆15Jun 11, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
haoweiz23 / DistDiff
View on GitHub
[NeurIPS 2024] The official repository of "Distribution-Aware Data Expansion with Diffusion Models".
☆17Dec 15, 2025Updated 7 months ago
ziqipang / MR-Video
View on GitHub
MR. Video: MapReduce is the Principle for Long Video Understanding
☆31Jun 18, 2026Updated last month
Cominclip / OmniVerifier
View on GitHub
[ICLR 2026 Oral & ICML 2026] Generative Universal Verifier as Multimodal Meta-Reasoner
☆64May 29, 2026Updated last month
Raibows / DynamicBatchSampler
View on GitHub
Yet another dynamic batch sampler for variable sequence data in PyTorch.
☆13Dec 9, 2021Updated 4 years ago
CYWang735 / AdaTooler-V
View on GitHub
☆71Feb 27, 2026Updated 4 months ago
YaoXingbo / MagicCity
View on GitHub
ICCV 2025
☆16Mar 26, 2026Updated 3 months ago
I-ESC / Project-Ava
View on GitHub
An implementation of Paper "Empowering Agentic Video Analytics Systems with Video Language Models"
☆31Nov 5, 2025Updated 8 months ago
multimodal-art-projection / P2P
View on GitHub
[ICLR 2026] P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark
☆54Jun 6, 2025Updated last year
GeunminHwang / DiffuseSlide
View on GitHub
Official implementation of DiffuseSlide
☆17Jun 30, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sastpg / RFTT
View on GitHub
RFTT: Reasoning with Reinforced Functional Token Tuning
☆29Feb 12, 2026Updated 5 months ago
mayuelala / GroupEditing
View on GitHub
[CVPR 2026] Group Editing: This repo is the official implementation of "Group Editing: Edit Multiple Images in One Go"
☆28Apr 3, 2026Updated 3 months ago
MAC-AutoML / WFS-SB
View on GitHub
[CVPR 2026] Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video Understanding
☆32Apr 12, 2026Updated 3 months ago
para-lost / AutoPresent
View on GitHub
Code for the paper "AutoPresent: Designing Structured Visuals From Scratch" (CVPR 2025)
☆174May 26, 2025Updated last year
KlingAIResearch / VANS
View on GitHub
[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
☆119Feb 28, 2026Updated 4 months ago
weiqingq / CLRKDNet
View on GitHub
☆20Jul 14, 2025Updated last year
Gen-Verse / GenEnv
View on GitHub
GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators
☆62Dec 23, 2025Updated 7 months ago
Gen-Verse / HermesFlow
View on GitHub
[NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation
☆77Sep 19, 2025Updated 10 months ago
ruili33 / TPO
View on GitHub
☆41Sep 9, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yaolu / ordered-prompt
View on GitHub
☆13Dec 13, 2022Updated 3 years ago
SanBingYouYong / shapecraft
View on GitHub
Official repo for NeurIPS 2025 poster - ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling.
☆27May 6, 2026Updated 2 months ago
Gen-Verse / MMaDA
View on GitHub
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
☆1,660Feb 14, 2026Updated 5 months ago
SherryXTChen / Instruct-CLIP
View on GitHub
Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning (CVPR 2025)
☆34Jun 10, 2025Updated last year
NJU-LINK / OmniVideoBench
View on GitHub
The Source Code for OmniVideoBench @ICLR 2026
☆77Feb 12, 2026Updated 5 months ago
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
yl3800 / EIGV
View on GitHub
☆15Aug 12, 2022Updated 3 years ago
Hoar012 / TDC-Video
View on GitHub
Official implementation of TDC.
☆15Jul 22, 2025Updated last year
CKboss / PyApplyLUT
View on GitHub
apply .cube file on image in python
☆16Oct 2, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
EnVision-Research / A4-Agent
View on GitHub
A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning (ECCV 2026)
☆41Jun 29, 2026Updated 3 weeks ago
zrz1996 / Spam-Email-Classifier-DataSet
View on GitHub
Some simple codes to format the CSDMC2010 SPAM corpus
☆11Sep 18, 2016Updated 9 years ago
rongyaofang / GoT
View on GitHub
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆317Sep 28, 2025Updated 9 months ago
AutoLab-SAI-SJTU / AutoPage
View on GitHub
[ACL2026 Findings] Official implementation for Human-Agent Collaborative Paper-to-Page Crafting
☆167Apr 8, 2026Updated 3 months ago
RUCBM / LeaF
View on GitHub
☆14Nov 2, 2025Updated 8 months ago
AVoCaDO-Captioner / AVoCaDO
View on GitHub
https://avocado-captioner.github.io/
☆37Oct 16, 2025Updated 9 months ago
camenduru / ControlNet-Video
View on GitHub
☆13Feb 18, 2023Updated 3 years ago