Meatfucker / metatron2Links
A Multimodal Discord bot with machine learning functions, including LLM chat, Image generation, and Speech Generation capabilities
β12Updated 2 years ago
Alternatives and similar repositories for metatron2
Users that are interested in metatron2 are comparing it to the libraries listed below
Sorting:
- β27Updated 2 years ago
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ20Updated 2 years ago
- Generate images from an initial frame and textβ37Updated 2 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedbackβ¦β10Updated 3 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β123Updated 6 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ16Updated last year
- Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (β¦β18Updated 2 years ago
- AudioLDM text to audio colabβ19Updated 2 years ago
- fine-tuning MusicGen without prompts to generate music with a specific styleβ67Updated 2 years ago
- Make-A-Video Latent Diffusion Modelβ19Updated 2 years ago
- β40Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)β25Updated 3 years ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β30Updated 7 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β32Updated 2 years ago
- Oobabooga extension for Bark TTSβ119Updated 2 years ago
- openai guided diffusion tweaksβ52Updated 3 years ago
- β12Updated 2 years ago
- Decked-out gradio client for audio diffusion, mainly stable-audio-tools.β38Updated 2 months ago
- Examples of apps built with Nendo, the AI Audio Tool Suiteβ55Updated last year
- π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model barkβ69Updated 6 months ago
- text-to-audio-latent-diffusionβ37Updated 2 years ago
- Resonance: Audio-Image Interconversion for AI Diffusion Modelsβ39Updated last year
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fixβ38Updated 3 years ago
- β24Updated 2 years ago
- β107Updated 2 years ago
- GradioUI for TortoiseTTS voice generationβ34Updated 2 years ago
- β18Updated last year
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuningβ13Updated 9 months ago
- β14Updated last year
- β11Updated last year