Automatic Video Generation from Scientific Papers
☆2,199Mar 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for Paper2Video
Users that are interested in Paper2Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Video generation via code☆1,601Nov 25, 2025Updated 3 months ago
- Muti-human Interactive Talking Dataset☆69Aug 6, 2025Updated 7 months ago
- ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands☆101Feb 6, 2026Updated last month
- [AAAI 2026] SlideTailor: Personalized Presentation Slide Generation for Scientific Papers☆48Jan 1, 2026Updated 2 months ago
- Nextjs RCE Exploit Kit☆154Feb 13, 2026Updated last month
- [ICCV 2025] Balanced Image Stylization with Style Matching Score☆68Mar 9, 2026Updated 2 weeks ago
- ☆43Aug 5, 2025Updated 7 months ago
- "Paper2Slides: From Paper to Presentation in One Click"☆3,199Mar 15, 2026Updated last week
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- Mattermost is an open source platform for secure collaboration across the entire software development lifecycle..☆27Oct 20, 2025Updated 5 months ago
- Exploring Representation-Aligned Latent Space for Better Generation☆18Mar 17, 2026Updated last week
- ☆12Nov 21, 2024Updated last year
- Source code of smaller projects.☆27Feb 1, 2026Updated last month
- The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".☆28Sep 1, 2025Updated 6 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated 11 months ago
- EmoCAST: Emotional Talking Portrait via Emotive Text Description☆29Dec 23, 2025Updated 3 months ago
- Terminal-based tool to track and manage build artifacts from multiple programming languages. Built with Ratatui☆85Nov 28, 2025Updated 3 months ago
- Nodes for image juxtaposition for Flux in ComfyUI☆12Apr 22, 2025Updated 11 months ago
- A lightweight frontend for ffmpeg intended specifically for convenient video clipping☆42Sep 23, 2025Updated 6 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆114Mar 21, 2025Updated last year
- ☆27Jan 28, 2026Updated last month
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory☆61Jan 13, 2026Updated 2 months ago
- [CVPR '26] SceneTok: A Compressed, Diffusable Token Space for 3D Scenes☆129Updated this week
- [SIGGRAPH Asia 2024 Conference Track] Boosting 3D Object Generation through PBR Materials☆15Dec 26, 2024Updated last year
- ☆19Dec 13, 2025Updated 3 months ago
- DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles☆32Mar 8, 2026Updated 2 weeks ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 9 months ago
- [CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…☆78Feb 26, 2026Updated 3 weeks ago
- Pytorch implementation of "SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery"☆53Updated this week
- GPT4 based personalized ArXiv paper assistant bot☆12Mar 1, 2024Updated 2 years ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated last month
- "DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"☆14,945Mar 3, 2026Updated 3 weeks ago
- Decoupled Memory Selection for Multi-target Video Segmentation of SAM3☆40Jan 16, 2026Updated 2 months ago
- Python library for building and sharing dataframe-agnostic, sklearn-style transformers and ml models for data science competitions.☆28Mar 10, 2026Updated last week
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,895Jan 8, 2026Updated 2 months ago
- Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"☆346Mar 16, 2026Updated last week
- ☆15Feb 23, 2026Updated last month
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,755Jan 20, 2026Updated 2 months ago