tjunttila / pdf2videoLinks
A tool for making videos from PDF presentations.
☆26Updated 4 years ago
Alternatives and similar repositories for pdf2video
Users that are interested in pdf2video are comparing it to the libraries listed below
Sorting:
- ☆20Updated last year
- Generate video stories with AI ✨☆33Updated 10 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 3 months ago
- AI Lip Syncing application, deployed on Streamlit☆43Updated last year
- Auto-Video maker handling many AI's☆11Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 2 years ago
- ☆83Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆45Updated 3 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 6 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- web based editor for subtitles and transcripts☆138Updated 11 months ago
- ☆13Updated last year
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆80Updated 10 months ago
- ☆28Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆59Updated this week
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆15Updated last month
- ☆47Updated last year
- VideoDB Python SDK☆75Updated this week
- ☆17Updated last year
- GroqChat: Local ChatGPT-like environment in your browser using best open model LLama 3.1 Series on the Grow fastest inference engine.☆86Updated 11 months ago
- AIPE (AI Pipeline Engine) is a flexible and powerful tool for creating and executing complex AI workflows☆21Updated 11 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- Summarize Youtube Videos and Generate Timestamps Efficiently using LLM [Google Gemini Pro, OpenAI ChatGPT]☆77Updated last month
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆18Updated last month
- OpenAI API and Whisper based Video Translation☆73Updated 7 months ago
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated last year
- Archived 🚧|🌻Building ChatBot with LLMs.🌻 | Using async requests. | 具有多 LLM 适应性 | 通用大语言模型代理端框架 |多人称全类型注解☆40Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 9 months ago