nrl-ai / CustomChar
Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.
β111Updated last year
Alternatives and similar repositories for CustomChar:
Users that are interested in CustomChar are comparing it to the libraries listed below
- π₯ Your private task assistant with GPT π₯ - Ask questions about your documents.β158Updated 7 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ56Updated 6 months ago
- Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into Your Favorite Celebrity's or Your Customized Voice. Our Cuttingβ¦β52Updated last month
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β66Updated last year
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features inclβ¦β17Updated 11 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ94Updated last year
- Generate video stories with AI β¨β32Updated 8 months ago
- A curated list of awesome stable diffusion resources πβ56Updated 3 weeks ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.β26Updated 11 months ago
- Video to video translation via few shot voice cloning & audio-based lip syncβ25Updated 10 months ago
- Input a YouTube video link or upload a video file and get a video with subtitles.β119Updated 8 months ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.β47Updated 3 months ago
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 β¦β158Updated 8 months ago
- ONNX-compatible Fast SeamlessM4TβMassively Multilingual & Multimodal Machine Translationβ43Updated last year
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITSβ52Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated last year
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differβ¦β19Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.β35Updated 2 years ago
- GUI to sync video mouth movements to match audio, utilizing wav2lip-hq. Completed as part of a technical interview.β11Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.β47Updated this week
- an improved version of Real-time-voice-cloningβ50Updated last year
- β138Updated 5 months ago
- Running the F5-TTS by ONNX Runtimeβ147Updated this week
- Simulates talk with an AI that can express emotionsβ68Updated 9 months ago
- Misc. tools/scripts that I made to use for tortoiseβ21Updated 8 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.β70Updated 10 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime portβ¦β19Updated 8 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β60Updated 7 months ago
- β79Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ159Updated 9 months ago