VikramxD / PicPilotLinks
Generate Stunning Images and Craft Visual Stories for your Brand
☆19Updated last year
Alternatives and similar repositories for PicPilot
Users that are interested in PicPilot are comparing it to the libraries listed below
Sorting:
- ☆17Updated last year
- Optimizing diffusion for production-ready speeds☆34Updated 3 weeks ago
- Gradio UI for a Cog API☆70Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆66Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated 2 years ago
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- ☆17Updated last year
- ☆21Updated 2 years ago
- ☆19Updated last year
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆38Updated 2 weeks ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Updated 3 months ago
- Gradio app to track objects in video and add visual effects☆17Updated 6 months ago
- ☆12Updated 2 years ago
- ☆51Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated 2 years ago
- Community ComfyUI workflows running on fal.ai☆57Updated last year
- ☆40Updated last year
- Retrieve the source code for any model made available on replicate.com!☆36Updated 2 years ago
- This repository is an implementation of converting sketches into lively videos using Google's Veo 3 model.☆76Updated 7 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated last year
- ☆23Updated last year
- ☆29Updated 2 years ago
- Build Web Datasets with Ease☆33Updated last year
- Daily.co + Pipecat + Tavus AI Avatar Agent☆15Updated 9 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- ☆18Updated 4 months ago
- Visual RAG using less than 300 lines of code.☆29Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- ☆41Updated last year
- ☆27Updated last year