Ravi-Teja-konda / AudioInsightsGenerator
Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions!
☆15Updated last year
Related projects: ⓘ
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆34Updated last week
- Cog wrapper for collabora/WhisperSpeech☆23Updated 6 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆41Updated last month
- Multichannel Looper/Feedback System for Riffusion☆12Updated last year
- A repo with scripts to test and play around with Facebook's recent llama models! 🤗☆29Updated last year
- Experimental sampler to make LLMs more creative☆29Updated last year
- ☆13Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated last year
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…☆30Updated 11 months ago
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆16Updated 2 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆24Updated last year
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated 6 months ago
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆24Updated 11 months ago
- Run embedding models using ONNX☆23Updated 7 months ago
- ☆23Updated 8 months ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆16Updated last year
- ☆22Updated 2 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆29Updated 3 weeks ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆17Updated 6 months ago
- With a few words and a click of a button, quickly get an engaging, high quality video. (And optionally save and share it!)☆17Updated 2 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆29Updated 2 months ago
- Fork of AudioLDM as a TuneFlow plugin☆38Updated last year
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆19Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆13Updated last month
- Explore the use of DSPy for extracting features from PDFs 🔎☆24Updated 6 months ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆28Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated 3 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆51Updated 5 months ago
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆15Updated 5 months ago