lucataco / cog-whisperspeechLinks
Cog wrapper for collabora/WhisperSpeech
☆25Updated last year
Alternatives and similar repositories for cog-whisperspeech
Users that are interested in cog-whisperspeech are comparing it to the libraries listed below
Sorting:
- Gradio UI for a Cog API☆69Updated last year
- Retrieve the source code for any model made available on replicate.com!☆34Updated last year
- Seamless Voice Interactions with LLMs☆12Updated last year
- ☆26Updated last year
- ☆19Updated 11 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆63Updated last year
- ☆16Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- ☆17Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- ☆12Updated last year
- ☆55Updated 2 weeks ago
- Finetune any model on HF in less than 30 seconds☆57Updated 2 weeks ago
- ☆40Updated last year
- Community ComfyUI workflows running on fal.ai☆58Updated 11 months ago
- Cog wrapper for FalconsAi / nsfw_image_detection☆16Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Apps that run on modal.com☆12Updated last month
- All the world is a play, we are but actors in it.☆50Updated 2 weeks ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 8 months ago
- ☆29Updated last year
- ☆31Updated last year
- Gradio app to track objects in video and add visual effects☆17Updated last week
- VideoDB Python SDK☆78Updated this week
- Style-Transfer: Apply the style of an image to another image☆53Updated last year
- This repository is an implementation of converting sketches into lively videos using Google's Veo 3 model.☆48Updated last month
- ☆21Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆14Updated 2 weeks ago
- ☆115Updated 7 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆18Updated last month