Ravi-Teja-konda / AudioInsightsGenerator
Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions!
☆21Updated last year
Alternatives and similar repositories for AudioInsightsGenerator:
Users that are interested in AudioInsightsGenerator are comparing it to the libraries listed below
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆48Updated 3 weeks ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆24Updated last year
- A full-text search for YouTube subtitles and video metadata with a command line interface.☆30Updated 3 weeks ago
- A python library to find differences between audio and transcriptions☆16Updated last year
- Ask shortgpt for instant and concise answers☆13Updated last year
- ☆14Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- On-device speaker recognition engine powered by deep learning☆32Updated last week
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…☆37Updated last year
- Chatbot web-applications with LLM, OpenAI API Assistants, LangChain, vector databases, and other AI stuff☆24Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆55Updated 10 months ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆32Updated 2 weeks ago
- Prepare spectrograms from audio for training a Riffusion model☆14Updated last year
- ☆12Updated last year
- GPT-4 powered code tool with no token limits. Works on repos or files. Can cleanup, optimize, comment, convert languages and more☆11Updated last year
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆15Updated 4 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆43Updated 6 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 4 months ago
- text-to-audio-latent-diffusion☆37Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- With a few words and a click of a button, quickly get an engaging, high quality video. (And optionally save and share it!)☆17Updated this week
- Multichannel Looper/Feedback System for Riffusion☆12Updated last year
- ☆16Updated last month
- A swarm of LLM agents that will help you test, document, and productionize your code!☆14Updated 3 weeks ago
- Use mark to run lots of prompts on lots of data☆18Updated last year
- Claudetools is a Python library that enables function calling with the Claude 3 family of language models from Anthropic.☆38Updated last month
- Fork of AudioLDM as a TuneFlow plugin☆39Updated last year
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated last year
- Monkey Island fine-tune of Stable Diffusion☆10Updated 2 years ago
- ai-validator is a powerful library that helps to extract and validate structured data from the output text of language models.☆16Updated last year