A simple, easy-to-understand library for diffusion models using Flax and Jax. Includes detailed notebooks on DDPM, DDIM, and EDM with simplified mathematical explanations. Made as part of my journey for learning and experimenting with generative AI.
☆41May 6, 2025Updated 9 months ago
Alternatives and similar repositories for FlaxDiff
Users that are interested in FlaxDiff are comparing it to the libraries listed below
Sorting:
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year
- ☆15May 11, 2025Updated 9 months ago
- Solving Inverse Problems with Diffusion Optimal Control [NeurIPS 2024]☆19Dec 21, 2024Updated last year
- JAX bindings for Flash Attention v2☆103Feb 19, 2026Updated last week
- Code repository for Particle Denoising Diffusion Sampler☆15Apr 2, 2024Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Official Pytorch repository for "Diffusion Posterior Sampling for Linear Inverse Problem Solving: A Filtering Perspective", where FPS (Fi…☆50Aug 6, 2024Updated last year
- Implementation of the ByteDance MagicMix paper☆19Nov 4, 2022Updated 3 years ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Aug 30, 2022Updated 3 years ago
- MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music☆26Jan 7, 2026Updated last month
- Alchera AI Competition 2nd Solution (body part segmentation)☆23Dec 7, 2021Updated 4 years ago
- JAX Implementations of Descript Audio Codec and EnCodec☆33Mar 30, 2025Updated 11 months ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated 11 months ago
- faster inference☆28Jan 20, 2025Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- [NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities☆69Dec 21, 2025Updated 2 months ago
- ☆13Oct 5, 2025Updated 4 months ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆34Feb 21, 2025Updated last year
- ☆36Sep 6, 2025Updated 5 months ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 4 months ago
- PyTorch implementation of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.☆27Aug 10, 2021Updated 4 years ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆22Feb 13, 2026Updated 2 weeks ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆42Mar 7, 2025Updated 11 months ago
- Split-Lohmann Multifocal Displays☆40Jun 27, 2024Updated last year
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35May 25, 2023Updated 2 years ago
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 5 months ago
- Code for the paper "Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription"☆40May 5, 2024Updated last year
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- 서울시 열섬현상 완화를 위한 녹지 및 바람길 입지 선정☆18Dec 29, 2019Updated 6 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 7 months ago
- Skeleton for scalable and flexible Jax RL implementations☆96Jul 1, 2023Updated 2 years ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆159Nov 1, 2022Updated 3 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- my profile readme☆14Updated this week
- ☆12Jul 8, 2024Updated last year