Ravi-Teja-konda / AudioInsightsGenerator
Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions!
☆21Updated last year
Alternatives and similar repositories for AudioInsightsGenerator:
Users that are interested in AudioInsightsGenerator are comparing it to the libraries listed below
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- ☆14Updated last year
- A full-text search for YouTube subtitles and video metadata with a command line interface.☆31Updated last month
- Text-to-Music Generation with Rectified Flow Transformer☆8Updated 6 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆18Updated 5 months ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆24Updated last year
- Text prompt steered synthetic audio generators☆46Updated last year
- Fork of AudioLDM as a TuneFlow plugin☆39Updated 2 years ago
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated last year
- AudioLDM text to audio colab☆19Updated last year
- ☆17Updated 2 months ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- Ai generated music video with Riffusion and Gradio☆20Updated 2 years ago
- Prepare spectrograms from audio for training a Riffusion model☆15Updated 2 years ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆19Updated 5 months ago
- songGPT is an experimental open-source project that explores the potential of Language Models, specifically ChatGPT, in generating origin…☆45Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆16Updated 5 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆44Updated 7 months ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- Get up and running with Llama 2, Mistral, Gemma, and other large language models.☆15Updated last year
- 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…☆40Updated 2 years ago
- With a few words and a click of a button, quickly get an engaging, high quality video. (And optionally save and share it!)☆17Updated last month
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 5 months ago
- A python library to find differences between audio and transcriptions☆17Updated last year
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 7 months ago
- Monkey Island fine-tune of Stable Diffusion☆10Updated 2 years ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 5 months ago