freddyaboulton / gradio-webrtc
☆42Updated last week
Related projects ⓘ
Alternatives and complementary repositories for gradio-webrtc
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826☆52Updated last month
- Community ComfyUI workflows running on fal.ai☆54Updated 2 months ago
- ASR + diarization model server with speculative decoding☆49Updated 5 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆16Updated last month
- ☆45Updated 8 months ago
- Gradio app to track objects in video and add visual effects☆16Updated last month
- ☆23Updated 5 months ago
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆41Updated last week
- ☆11Updated 3 weeks ago
- ☆93Updated 2 months ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated 8 months ago
- StoryDiffusion serverless worker☆13Updated 5 months ago
- ☆55Updated 10 months ago
- A quality zero-shot lipsync pipeline built with MuseTalk, LivePortrait, and CodeFormer.☆27Updated last month
- ☆22Updated 3 weeks ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆61Updated last week
- A Gradio component that can be used to annotate images with bounding boxes.☆31Updated 2 weeks ago
- ☆47Updated this week
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Updated 2 months ago
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆17Updated 2 weeks ago
- ☆28Updated 10 months ago
- Running the F5-TTS by ONNX Runtime☆27Updated last week
- ☆30Updated 10 months ago
- Industry leading face manipulation platform☆53Updated last week
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated last month
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆14Updated 10 months ago
- Incredibly descriptive audiovisual summaries for videos☆39Updated 3 months ago
- ☆66Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆37Updated 2 months ago