Ravi-Teja-konda / AudioInsightsGeneratorLinks
Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions!
☆24Updated 5 months ago
Alternatives and similar repositories for AudioInsightsGenerator
Users that are interested in AudioInsightsGenerator are comparing it to the libraries listed below
Sorting:
- Implements ML audio separation algorithm on audio from YouTube or Spotify resulting in "stems" for download (e.g. vocals, drums, bass) in…☆34Updated this week
- A full-text search for YouTube subtitles and video metadata with a GUI and command line interface.☆39Updated 3 weeks ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- Sing an idea ➡️ AI music sample🔥🎶☆119Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆47Updated 8 months ago
- Create your own RVC v2 dataset from a youtube video☆30Updated last year
- The purpose of this repository is to discuss on Audio transformers☆13Updated 3 months ago
- The Open Source AI Musical Toolkit☆46Updated 3 weeks ago
- Bypass browser bot detection in langchain tools☆15Updated 7 months ago
- A curated list of awesome OpenAI's Whisper☆99Updated 2 years ago
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆19Updated last year
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆18Updated last week
- Create Unmute voice embeddings☆21Updated 3 weeks ago
- ☆19Updated last year
- Incredibly descriptive audiovisual summaries for videos☆40Updated last year
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆87Updated 11 months ago
- Audio datasets, easier.☆86Updated 2 years ago
- LLM finetuned for generating symbolic music☆42Updated last year
- Fork of AudioLDM as a TuneFlow plugin☆41Updated 2 years ago
- Large-Language-Model to Machine Interface project.☆19Updated 2 years ago
- Local LLaMAs/Models in VSCode☆54Updated 2 years ago
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.☆64Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- Ultra-minimal autoregressive diffusion model for image generation☆21Updated last year
- Based on kylemcdonald/i2i-realtime. The warping server for GenDJ real time webcam AI warping☆31Updated 8 months ago
- Chat to Compose Video☆197Updated last year