Implementations of Recent Papers in Computer Vision
☆38Sep 20, 2022Updated 3 years ago
Alternatives and similar repositories for ComVEX
Users that are interested in ComVEX are comparing it to the libraries listed below
Sorting:
- ☆10Apr 8, 2024Updated last year
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- Official implementation of OSSGAN [CVPR 2022]☆21May 2, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Code release for Grad-CAM Guided Attention Module for Fine-grained Visual Classification (MLSP 2022)☆13Aug 25, 2021Updated 4 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- CoaT: Co-Scale Conv-Attentional Image Transformers☆16Apr 20, 2021Updated 4 years ago
- Paper List about Radiology Report Generation and also some medical image captioning☆11Oct 5, 2021Updated 4 years ago
- Deep neural architecture research framework☆12Mar 24, 2023Updated 2 years ago
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Jan 19, 2023Updated 3 years ago
- ☆23Jun 13, 2023Updated 2 years ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated last year
- follow NVIDIA, simplify it and support data parallel.☆13Sep 26, 2019Updated 6 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- Code release for "Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification"☆16Nov 8, 2021Updated 4 years ago
- Lightweight Transformer for Multi-modal Tasks☆16Dec 9, 2022Updated 3 years ago
- ☆21Jul 2, 2022Updated 3 years ago
- Reproducible research and reusable acyclic workflows in Python. Execute code on HPC systems as if you executed them on your personal comp…☆18Jan 11, 2022Updated 4 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- ☆18Jan 17, 2022Updated 4 years ago
- ☆15May 8, 2021Updated 4 years ago
- ☆14Feb 24, 2021Updated 5 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆18Dec 4, 2023Updated 2 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆20Jan 12, 2023Updated 3 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆22Jan 18, 2023Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- ☆17Aug 27, 2025Updated 6 months ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Oct 19, 2022Updated 3 years ago
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆54Feb 14, 2022Updated 4 years ago
- Speech synthesis using LPC☆23Jun 5, 2021Updated 4 years ago