Overworldai/owl-vaes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Overworldai/owl-vaes)

Overworldai / owl-vaes

Weird autoencoder experiments

☆24

Alternatives and similar repositories for owl-vaes

Users that are interested in owl-vaes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago
etzinis / optimal_condition_training
View on GitHub
Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…
☆14Feb 15, 2023Updated 3 years ago
Gengzigang / TokenSet
View on GitHub
Official PyTorch implementation of TokenSet.
☆129Mar 21, 2025Updated last year
primepake / learnable-speech
View on GitHub
This repo is text to speech with learnable audio encoder without alignment with transcript reference
☆54Sep 20, 2025Updated 10 months ago
Yaofang-Liu / Mochi-Full-Finetuner
View on GitHub
Code for full fintuing Mochi model with FSDP (and CP)
☆29Jul 15, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
microsoft / AudioEntailment
View on GitHub
Audio Entailment: Deductive Reasoning for Audio Understanding
☆17Dec 10, 2024Updated last year
kvfrans / matrix-whitening
View on GitHub
Code for "What really matters in matrix-whitening optimizers?"
☆25Oct 31, 2025Updated 8 months ago
lucaslingle / mu_transformer
View on GitHub
Official implementation of 'A Large-Scale Exploration of mu-Transfer' (CoRR 2024)
☆31Jun 5, 2025Updated last year
Wataru-Nakata / latentlm-tts
View on GitHub
☆29Jul 3, 2026Updated 2 weeks ago
fal-ai-community / alphabet-dataset
View on GitHub
Synthetic Alphabet Dataset
☆19Mar 27, 2025Updated last year
MaxxP0 / WorldModel
View on GitHub
WorldModel is a MaskGIT model trained on 8x8x8 Minecraft voxel volumes. Beyond generating blocks from scratch, it excels in filling space…
☆14Sep 12, 2023Updated 2 years ago
naver-ai / RapFlow-TTS
View on GitHub
☆56Jul 16, 2025Updated last year
yitongdeng-projects / infinite_resolution_integral_noise_warping_code
View on GitHub
Official Implementation of Infinite-Resolution Integral Noise Warping for Diffusion Models [ICLR 2025]
☆16Mar 15, 2025Updated last year
Arongil / lipschitz-transformers
View on GitHub
Don't just regulate gradients like in Muon, regulate the weights too
☆32Jul 30, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Sreyan88 / CompA
View on GitHub
Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
☆23Jul 10, 2024Updated 2 years ago
yinboc / dito
View on GitHub
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
☆168Jan 31, 2025Updated last year
GFNOrg / diffusion-finetuning
View on GitHub
☆43Jul 26, 2024Updated last year
jlevy / simple-modern-uv-template
View on GitHub
GITHUB TEMPLATE — Click "Use this template" above or see link below for docs:
☆15Jul 13, 2026Updated last week
chrisdonahue / ddc_onset
View on GitHub
Music onset detector from Dance Dance Convolution packaged as a lightweight PyTorch module
☆43Sep 22, 2023Updated 2 years ago
RyannDaGreat / rp
View on GitHub
This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5
☆13Jul 13, 2026Updated last week
j0seo / lookahead-anchoring
View on GitHub
☆15Oct 27, 2025Updated 8 months ago
apple-yinhan / Noise-robust-SED
View on GitHub
☆14Jan 2, 2025Updated last year
apple / ml-flextok
View on GitHub
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆321Jun 2, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZhouHuang23 / SBA-Net
View on GitHub
Dataset, code and results repository for SBA-Net.
☆14Sep 23, 2022Updated 3 years ago
mkshing / scedit-pytorch
View on GitHub
Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"
☆85Dec 26, 2023Updated 2 years ago
mstou / White-Noise-CanSat2018
View on GitHub
DrillSat 2018
☆16Oct 7, 2018Updated 7 years ago
gstoica27 / DeltaFM
View on GitHub
[ICCV 2025] Official Implementation of Contrastive Flow Matching
☆183Jun 25, 2025Updated last year
primepake / F5-TTS-meanflow-multilingual
View on GitHub
Meanflow and multilingual for F5-TTS model
☆16Aug 23, 2025Updated 10 months ago
cloneofsimo / min-max-in-dit
View on GitHub
☆27May 3, 2024Updated 2 years ago
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
eclipse-t2i / eclipse-inference
View on GitHub
[CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"
☆65May 1, 2024Updated 2 years ago
zjwang21 / mix-phoneme-bert
View on GitHub
An unofficial PyTorch implementation of Mix-Phoneme-Bert
☆40Jul 10, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
UCSB-NLP-Chang / diffusion_resampling
View on GitHub
Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]
☆34Dec 12, 2023Updated 2 years ago
zafarrafii / CQHC-Python
View on GitHub
Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.
☆29Sep 13, 2025Updated 10 months ago
MCG-NJU / PixNerd
View on GitHub
[ICLR 2026] PixNerd: Pixel Neural Field Diffusion
☆182Dec 10, 2025Updated 7 months ago
SwayStar123 / reimei
View on GitHub
☆28Oct 7, 2025Updated 9 months ago
google-deepmind / objaverse_annotations
View on GitHub
☆15Dec 16, 2023Updated 2 years ago
lfranke / vr_splatting
View on GitHub
☆26Jul 3, 2025Updated last year
martin-marek / batch-size
View on GitHub
📄Small Batch Size Training for Language Models
☆82Mar 18, 2026Updated 4 months ago