Official Implementation for StreamFlow: Streamlined Multi-Frame Optical Flow Estimation for Video Sequences, NeurIPS' 24
☆40Mar 10, 2025Updated 11 months ago
Alternatives and similar repositories for StreamFlow
Users that are interested in StreamFlow are comparing it to the libraries listed below
Sorting:
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated 3 weeks ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- [CVPR 2024] MemFlow: Optical Flow Estimation and Prediction with Memory☆224Oct 6, 2025Updated 4 months ago
- ☆12Feb 3, 2026Updated last month
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- ☆11Sep 26, 2024Updated last year
- ☆11Feb 20, 2025Updated last year
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 2 years ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆35Oct 26, 2025Updated 4 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated last week
- ☆15Mar 31, 2025Updated 11 months ago
- ☆15Aug 22, 2025Updated 6 months ago
- Official implementation of the paper: "NeoBabel: A Multilingual Open Tower for Visual Generation"☆23Aug 4, 2025Updated 6 months ago
- DVC-P: Deep Video Compression with Perceptual Optimizations☆15Oct 18, 2021Updated 4 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- [CVPR 2025] ZeroMSF: Zero-shot Monocular Scene Flow Estimation in the Wild☆36Sep 16, 2025Updated 5 months ago
- Re-implementation of SLAM-ASR paper's experiment, using Phi-2 and Hubert☆21Jun 14, 2024Updated last year
- GMFSS_union function for VapourSynth☆22Apr 8, 2023Updated 2 years ago
- ☆19May 2, 2024Updated last year
- Video Frame Interpolation Via Videoflow☆19Oct 4, 2023Updated 2 years ago
- Fast Defocus Map Estimation☆18Oct 14, 2016Updated 9 years ago
- [ICME 2024] Official Repository for The Paper, PianoBART: Symbolic Piano Music Understanding and Generating with Large-Scale Pre-Training☆22Aug 17, 2025Updated 6 months ago
- Official code for Deep Bayesian Video Frame Interpolation (ECCV2022)☆18May 29, 2023Updated 2 years ago
- ☆47Aug 31, 2024Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- [IROS'24] V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting☆25Aug 8, 2025Updated 6 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- Unofficial implementation of wavenext vocoder☆59Aug 28, 2024Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 10 months ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 4 months ago
- Voice conversion with just linear regression.☆35Sep 25, 2025Updated 5 months ago
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- ☆22Apr 4, 2023Updated 2 years ago