[EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation
☆129Nov 22, 2025Updated 3 months ago
Alternatives and similar repositories for PresentAgent
Users that are interested in PresentAgent are comparing it to the libraries listed below
Sorting:
- A powerful desktop app turning ppt to video with AI voiceover and subtitles☆25Aug 23, 2025Updated 6 months ago
- AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! 🎙️🤖🎧☆10Aug 30, 2024Updated last year
- ☆29Nov 10, 2025Updated 3 months ago
- Cheatsheet of the Mojo programming language☆10May 18, 2023Updated 2 years ago
- This is a Python tool for converting USD models to MuJoCo MJCF format.☆29Jul 28, 2025Updated 7 months ago
- A Software-as-a-Service app with AI features and payments & credits system built using Next.js 14, Clerk, MongoDB, Cloudinary AI, and Str…☆17Jan 30, 2026Updated last month
- Unsupervised anomaly detection for auditing datasets and impact of categorical encodings☆24Aug 20, 2024Updated last year
- ☆33Jul 15, 2025Updated 7 months ago
- Xiwu: A Large Lanauge Model for High Energy Physics☆21Jan 20, 2025Updated last year
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆31Jan 13, 2026Updated last month
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆23Nov 8, 2024Updated last year
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆82Jan 16, 2026Updated last month
- Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.☆37Jul 3, 2025Updated 8 months ago
- ☆47Aug 5, 2025Updated 6 months ago
- ☆51Jul 31, 2025Updated 7 months ago
- ☆34Feb 6, 2026Updated 3 weeks ago
- ☆55Feb 2, 2026Updated last month
- 3D Gaussian Splatting for underwater scene reconstruction via physcial-based appearance-medium decoupling☆23Feb 13, 2026Updated 2 weeks ago
- ☆29Dec 16, 2025Updated 2 months ago
- Platform API Project seed☆12Nov 8, 2023Updated 2 years ago
- ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback☆121Sep 20, 2025Updated 5 months ago
- ☆41Jun 9, 2025Updated 8 months ago
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆655Jan 22, 2026Updated last month
- Isaac for Healthcare reference workflows☆96Jan 6, 2026Updated last month
- 这是一次学校大作业,希望和大家分享,一起进步。此项目分驱动部分,遥控部分,视觉部分以及Web控制部分。是基于ESP32与Jetson Nano做的一个小项目。其中运用到了蓝牙串口片与片之间的通信,IP私域下的多机通信,以及ESP32中便携的Web功能进行通信。具体各部分内容…☆12Nov 5, 2024Updated last year
- This is a list used to collect the available (open-source / closed-source) projects that comply with Google Agent2Agent.☆13Apr 24, 2025Updated 10 months ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- Web interface for building and managing your own agentic record label.☆10Updated this week
- 稚晖君电子Esp32脱机版☆11Jan 15, 2025Updated last year
- 二维码活码管理系统☆10Jun 17, 2020Updated 5 years ago
- Workflow Runtime Engine based on CNCF Workflow Specification for Agentic Workflows☆22Updated this week
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆23Dec 10, 2025Updated 2 months ago
- Arduino library for Gavesha® Robomatics Gear Motor.☆10Feb 15, 2025Updated last year
- MiniGPT-Pancreas: Multimodal Large language Model for Pancreas Cancer Classification and Detection☆11Sep 19, 2025Updated 5 months ago
- The official Stream feeds library for Android☆22Feb 16, 2026Updated 2 weeks ago
- ☆72Jan 29, 2026Updated last month
- 🕹 Pikachu-volleyball game-based multi-agent RL environment using PettingZoo☆11Sep 29, 2024Updated last year
- ☆14Aug 10, 2025Updated 6 months ago
- musicai☆12Apr 20, 2024Updated last year