kardSIM / audio2imgView external linksLinks
Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model
☆13Sep 25, 2024Updated last year
Alternatives and similar repositories for audio2img
Users that are interested in audio2img are comparing it to the libraries listed below
Sorting:
- Vehicle speed estimation using YOLOv9 for object detection and DeepSORT for tracking☆16Sep 13, 2024Updated last year
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- CoreXY conversion for the Folgertech FT-5 printer☆15Feb 20, 2024Updated last year
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- A modified Ziggurat Algorithm for efficiently generating exponentially- and normally-distributed PseudoRandom Numbers (PRNs).☆12May 21, 2025Updated 8 months ago
- Optimization solvers in pure Python: LP, MILP, SAT, constraint programming, graph and metaheuristics. No dependencies. Solvor all your op…☆25Feb 1, 2026Updated last week
- Roadmap to become a Linux-Fine☆10Jul 13, 2024Updated last year
- ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2,…☆29Feb 6, 2026Updated last week
- A PyTorch implementation of the shearlet transform.☆13Oct 9, 2025Updated 4 months ago
- ☆15Mar 11, 2025Updated 11 months ago
- An extension for oobabooga´s Text Generation WebUI☆11May 29, 2023Updated 2 years ago
- A notebook containing implementations of different graph deep node embeddings along with benchmark graph neural network models in tensorf…☆13Jul 17, 2021Updated 4 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated last month
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆17Jan 19, 2026Updated 3 weeks ago
- This is the repo with the code to conduct a comparative analysis of different audio representation models.☆12Aug 31, 2023Updated 2 years ago
- Your FREE AWS Journey Starts Here! (O.V.E.R)☆10Feb 6, 2026Updated last week
- ☆34Oct 29, 2025Updated 3 months ago
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆23Dec 15, 2025Updated last month
- This tool kit provides a quickstart for working with OpenSearch and ML models, especially LLMs for vector embeddings to power sementic an…☆17Jan 29, 2026Updated 2 weeks ago
- Official code for "IT³: Idempotent Test-Time Training" (ICML 2025)☆14Jun 25, 2025Updated 7 months ago
- For world model code developing and releasing.☆29Feb 6, 2026Updated last week
- ☆17Apr 22, 2024Updated last year
- ☆14Dec 16, 2022Updated 3 years ago
- A simple SDXL fine-tuning toolkit based on the DreamBooth branch of AutoTrain Advanced from 🤗, inspired by the way ai-toolkit approaches…☆18Sep 30, 2024Updated last year
- miaoshouai-assistant for webui-forge☆15Aug 15, 2024Updated last year
- ☆12Dec 23, 2024Updated last year
- ☆14Sep 19, 2024Updated last year
- ☆10May 14, 2024Updated last year
- ☆12Oct 23, 2022Updated 3 years ago
- xformers prebuild wheels for various video cards, suitable for both paperspace and google colab☆12Apr 7, 2023Updated 2 years ago
- An implementation of LLMzip using GPT-2☆13Aug 7, 2023Updated 2 years ago
- Agentic BYOK Browser-Based Website Builder☆25Feb 6, 2026Updated last week
- Re-taking voice conversations to the moon 🚀☆12Nov 9, 2022Updated 3 years ago
- RLHF for Video Diffusion Models☆23Jul 30, 2025Updated 6 months ago
- A script for merging a LLM model and a LoRA☆13Jun 22, 2023Updated 2 years ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆40Jan 29, 2026Updated 2 weeks ago
- Distributed NeuroSynapse Engine leveraging Predictive Modeling and Streaming Analytics to drive Intelligent Data Insights Explorer.☆31Jan 7, 2026Updated last month
- Unlocking SaikouHub's Synergy with Edge Computing, Real-Time Analytics, and Adaptive AI-Driven Orchestration Core.☆36Jan 14, 2026Updated 3 weeks ago
- Controlnet module for Wan2.1☆30Aug 4, 2025Updated 6 months ago