martintomov / gpt4v-video-voiceoverLinks
Video Voiceover with gpt-4o-mini
☆32Updated 11 months ago
Alternatives and similar repositories for gpt4v-video-voiceover
Users that are interested in gpt4v-video-voiceover are comparing it to the libraries listed below
Sorting:
- Talking head video AI generator☆79Updated last year
- ☆89Updated last year
- ☆34Updated last year
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆88Updated last year
- Agent with vision ability via llava & autogen☆73Updated last year
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆39Updated 2 years ago
- Simple example to showcase how to use llamaparser to parse PDF files☆91Updated 11 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- Get started using Deepgram's Live Transcription with this Flask demo app☆40Updated last month
- AvaChat - is a realtime AI chat demo with animated talking heads - it uses Large Language Models via api (OpenAI and Claude) as text inpu…☆110Updated 4 months ago
- skills for autogen studio agents. python code skills. copy and paste into autogen studio☆66Updated last year
- Open Sourced NoteBookLM☆59Updated 11 months ago
- ☆31Updated last year
- Langchain tools to search/extract/transcribe text transcripts of Youtube videos. Some of this has been integrated into LangChain main bra…☆74Updated 2 years ago
- An intellligent AI assistant that can do anything!☆54Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- ChatTTS + Ollama☆84Updated last year
- This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).☆43Updated 5 months ago
- Simli WebRTC AI Agent demo☆23Updated 9 months ago
- This Discord chatbot is incredibly versatile, offering a wide range of customization options.☆95Updated last year
- A reproduction of the Gemini demo using GPT-vision.☆127Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 10 months ago
- ☆79Updated last year
- ☆97Updated last year
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆119Updated last year
- Started out as Dynamic Function Calling for OAI. Upon reviewing a research paper released (LATM) This is/has become a implementation of s…☆98Updated last year
- Multimodal Chat with Gemini API☆48Updated last year
- Generate video stories with AI ✨☆32Updated last year
- ☆46Updated last year
- ✨ Experience the enchantment of Story Blocks: an open-source project merging AI text generation and image synthesis to create captivating…☆63Updated last year