☆10Jan 5, 2020Updated 6 years ago
Alternatives and similar repositories for voca-pytorch
Users that are interested in voca-pytorch are comparing it to the libraries listed below
Sorting:
- Frequency tracking in time-frequency representations☆13Jan 19, 2021Updated 5 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- Time frequency ridge detection based on relevant ridge portions☆11Aug 17, 2023Updated 2 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- ☆13Aug 7, 2025Updated 7 months ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated last month
- A Pytorch-Lightning Implementation of Transformer Network☆11Oct 22, 2020Updated 5 years ago
- Extension to operate Matlab in VS Code☆10Sep 15, 2022Updated 3 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- Pytorch code for paper: MVF-Net: Multi-View 3D Face Morphable Model Regression☆10May 28, 2019Updated 6 years ago
- Python package for P2 (Path Planning), a masked diffusion model sampling method for sequence generation (protein, text, etc.).☆23Aug 19, 2025Updated 6 months ago
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆16Oct 4, 2025Updated 5 months ago
- LLaVA-Next for STVG☆18Dec 5, 2025Updated 3 months ago
- Mamba SSM architecture that supports training on variable-length sequences☆12Sep 1, 2025Updated 6 months ago
- [ICTC'24] - "Voice-Based Age and Gender Recognition: A Comparative Study of LSTM, RezoNet and Hybrid CNNs-BiLSTM Architecture" by Nhut Mi…☆10Jan 16, 2025Updated last year
- ☆12Jun 21, 2023Updated 2 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆12Aug 28, 2023Updated 2 years ago
- ☆11Nov 5, 2025Updated 4 months ago
- An exploration of LLM steering☆24Jun 15, 2024Updated last year
- Examples of how to use API of MVSep service☆29Jun 21, 2025Updated 8 months ago
- 2023 Spring SNU Computer Vision Project☆14Jun 13, 2023Updated 2 years ago
- Reproducible research code for the experiments presented in our article "Kara1k: a karaoke dataset for cover song identification and sing…☆10Jan 9, 2018Updated 8 years ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- ☆13Sep 26, 2023Updated 2 years ago
- A system that combines object detection and stereo depth calculations☆10Dec 14, 2018Updated 7 years ago
- ☆10Nov 16, 2021Updated 4 years ago
- Human age estimation using deep neural networks (Keras)☆14Aug 10, 2023Updated 2 years ago
- Python library for searching lyrics on Musixmatch, Genius and letras.mus.br.☆10Oct 10, 2024Updated last year
- JPEG-LM: LLMs as Image Generators with Canonical Codec Representations☆14Sep 29, 2024Updated last year
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆14Jun 27, 2023Updated 2 years ago
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- ☆12Dec 29, 2023Updated 2 years ago
- Official code of paper IntrinsicNGP☆15Sep 25, 2023Updated 2 years ago
- Hash Encoding, Point Cloud Reconstruction, Multi-view Reconstruction, CVM2023, (CVMJ)☆17Mar 12, 2024Updated last year
- 2D/3D physics engine for games written in Rust☆12Mar 7, 2022Updated 4 years ago
- CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction☆18Oct 20, 2025Updated 4 months ago
- ☆12Mar 18, 2024Updated last year