The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!
☆119Oct 24, 2025Updated 4 months ago
Alternatives and similar repositories for NewEraAI-Papers
Users that are interested in NewEraAI-Papers are comparing it to the libraries listed below
Sorting:
- FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and g…☆16May 18, 2024Updated last year
- ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore t…☆517May 5, 2025Updated 10 months ago
- Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better u…☆26Apr 19, 2024Updated last year
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆687Dec 25, 2024Updated last year
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆11Dec 19, 2025Updated 2 months ago
- This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".☆11Dec 2, 2024Updated last year
- Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings acc…☆24Jan 31, 2025Updated last year
- ☆10May 15, 2021Updated 4 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Oct 11, 2022Updated 3 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- ISMIR 2023 Papers: A complete collection of influential and exciting research papers from the ISMIR 2023 conference.☆106Dec 2, 2023Updated 2 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- This repository contains the official implementation of "A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular Data" (under revie…☆17Jul 10, 2024Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆46May 16, 2025Updated 9 months ago
- PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…☆17Dec 11, 2022Updated 3 years ago
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆23Feb 20, 2025Updated last year
- ☆28Jul 31, 2025Updated 7 months ago
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆23Apr 23, 2024Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆108Jan 17, 2025Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆86Aug 3, 2023Updated 2 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆21Dec 20, 2023Updated 2 years ago
- Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perfo…☆58Dec 5, 2025Updated 3 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 4 years ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Aug 10, 2023Updated 2 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- Script to perform statistical significance test between ASR hypotheses.☆22Aug 13, 2017Updated 8 years ago
- Repository for the LLM course☆30Jan 4, 2026Updated 2 months ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.☆52Aug 12, 2021Updated 4 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- ☆60Oct 22, 2025Updated 4 months ago
- Solos: A Dataset for Audio-Visual Music Analysis☆24Feb 17, 2023Updated 3 years ago