sony/san

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sony/san)

sony / san

PyTorch implementation of slicing adversarial network (SAN)

☆98

Alternatives and similar repositories for san

Users that are interested in san are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sony / bigvsan
View on GitHub
Pytorch implementation of BigVSAN
☆203Dec 9, 2025Updated 7 months ago
EzioBy / glead
View on GitHub
[CVPR 2023] GLeaD: Improving GANs with A Generator-Leading Task
☆32Jun 5, 2023Updated 3 years ago
sarulab-speech / ml-audiocaps
View on GitHub
Multi-lingual AudioCaps
☆14Nov 20, 2023Updated 2 years ago
reppy4620 / vocoders
View on GitHub
My vocoder experiments
☆31Jul 26, 2025Updated last year
kobeshegu / FreGAN_NeurIPS2022
View on GitHub
[NeurIPS2022] FreGAN: Exploiting Frequency Components for Training GANs under Limited Data
☆57Oct 17, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
sarulab-speech / Coco-Nut
View on GitHub
Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
☆21Jun 12, 2024Updated 2 years ago
catlab-team / fantasticstyles
View on GitHub
Repository for Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs
☆28Mar 17, 2022Updated 4 years ago
PlayVoice / BigVGAN
View on GitHub
BigVGAN with Neural Source-Filter
☆58Sep 21, 2023Updated 2 years ago
davispolito / Phase-Vocoder
View on GitHub
☆13Apr 10, 2020Updated 6 years ago
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
merlresearch / reverberation-as-supervision
View on GitHub
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
☆15Aug 1, 2024Updated last year
P1ping / TokAN-Legacy
View on GitHub
☆27Jun 22, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mdx-workshop / mdx-submissions21
View on GitHub
Music Demixing Challenge Submission Repo
☆16Sep 8, 2023Updated 2 years ago
jacobleehei / genforce-streamlit
View on GitHub
☆16Oct 20, 2021Updated 4 years ago
W-Wu / ERC-SLT22
View on GitHub
Code for "Distribution-based Emotion Recognition in Conversation"
☆18Feb 6, 2023Updated 3 years ago
google / df-conformer
View on GitHub
Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.
☆36Jun 23, 2026Updated last month
soumimaiti / speechlmscore_tool
View on GitHub
☆34Nov 24, 2024Updated last year
archinetai / cqt-pytorch
View on GitHub
An invertible and differentiable implementation of the Constant-Q Transform (CQT).
☆73Dec 9, 2022Updated 3 years ago
YangAi520 / NSPP
View on GitHub
☆55Mar 2, 2023Updated 3 years ago
colinlaganier / FederatedDiffusionModels
View on GitHub
Federated Learning of Diffusion Models
☆12Aug 30, 2023Updated 2 years ago
Tayjsl97 / RL-Chord
View on GitHub
This is the official implementation of RL-Chord (TNNLS).
☆13Jan 2, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
fgnt / mms_msg
View on GitHub
Multipurpose Multi Speaker Mixture Signal Generator
☆46Feb 6, 2025Updated last year
supertone-inc / super-monotonic-align
View on GitHub
☆173Sep 19, 2024Updated last year
yamathcy / music-deeplearning-japanese
View on GitHub
深層学習×音楽情報処理勉強会@筑波大学・人と音の情報学研究室
☆19Jul 9, 2023Updated 3 years ago
sjhan91 / Loop_VQVAE_Official
View on GitHub
The implementation of "Symbolic Music Loop Generation with Neural Discrete Representations"
☆34Aug 24, 2022Updated 3 years ago
PrincetonLIPS / MaM
View on GitHub
Official code for Generative Marginalization Models [ICML 2024] [SPGIM 2023 Workshop Oral]
☆23Aug 19, 2024Updated last year
gudgud96 / basic-pitch-torch
View on GitHub
PyTorch version of Spotify's Basic Pitch
☆54Apr 19, 2024Updated 2 years ago
exercise-book-yq / Supercodec
View on GitHub
☆51Mar 5, 2026Updated 4 months ago
Andong-Li-speech / BridgeVoC
View on GitHub
This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".
☆67Nov 5, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
mchijmma / modeling-nonlinear
View on GitHub
Modeling of nonlinear audio effects with end-to-end deep neural networks - website:
☆17May 11, 2020Updated 6 years ago
jxhuang0508 / GenCo
View on GitHub
<GenCo: Generative Co-training for Generative Adversarial Networks with Limited Data> in AAAI 2022
☆16Dec 20, 2021Updated 4 years ago
iceli1007 / FakeCLR
View on GitHub
[ECCV 2022] FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs
☆22Nov 16, 2022Updated 3 years ago
lucidrains / rvq-vae-gpt
View on GitHub
My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation
☆90Oct 11, 2024Updated last year
AI-S2-Lab / FluentEditor
View on GitHub
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆62Oct 23, 2024Updated last year
raven38 / OSSGAN
View on GitHub
Official implementation of OSSGAN [CVPR 2022]
☆21May 2, 2022Updated 4 years ago
mehranagh20 / AdaIMLE
View on GitHub
Official PyTorch implementation of the ICML 2023 paper "Adaptive IMLE for Few-shot Pretraining-free Generative Modelling "
☆16Feb 13, 2025Updated last year