KevKibe / African-Whisper
π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
β18Updated this week
Related projects β
Alternatives and complementary repositories for African-Whisper
- β105Updated last month
- β‘οΈFramework for fast persistent storage of multiple document embeddings and metadata into Pinecone for source-traceable, production-levelβ¦β13Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β45Updated 2 weeks ago
- Indic-Conformer models for ASRβ15Updated 4 months ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β47Updated last year
- Audio search using Azure Cognitive Searchβ21Updated last year
- AI Assistant, an end-to-end LLMOps project, fine-tuned to provide support for coding and design questions based on the latest trends in tβ¦β16Updated 10 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languagesβ90Updated last month
- a simple system for 2-way interruptible voice interactions between human and LLMβ17Updated 9 months ago
- β152Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionβ68Updated 6 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsβ65Updated 2 years ago
- 'Grad-TTS' with Multilingual Cleanersβ10Updated 7 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β43Updated 3 months ago
- β10Updated 2 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUsβ36Updated 3 weeks ago
- β35Updated 4 years ago
- Text To Speech Multilingual Support (+20 Language)β35Updated last year
- Speaker diarization serviceβ19Updated this week
- Repository for fine-tuning gemma models using unsloth for indic languagesβ82Updated 8 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning β¦β24Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPUβ30Updated last year
- LLM finetuned for generating symbolic musicβ33Updated 2 months ago
- GGUF Quantization of any LLM.β31Updated 8 months ago
- Repository contains code to fine-tune WhisperASR modelβ23Updated last year
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speechβ91Updated last year
- π³ AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages π€π§βπ³β19Updated 3 weeks ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β17Updated 3 years ago
- AI Voice Assistant: talk to an AI agent that handles event scheduling, managing contacts, accessing your knowledge base and web searchingβ¦β13Updated 3 months ago
- Write tweets with AI Agents (CrewAI) and LLMs (Llama 3, GPT-4o)β20Updated 5 months ago