The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!
☆118Oct 24, 2025Updated 5 months ago
Alternatives and similar repositories for NewEraAI-Papers
Users that are interested in NewEraAI-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and g…☆15May 18, 2024Updated last year
- ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore t…☆523May 5, 2025Updated 10 months ago
- Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better u…☆26Apr 19, 2024Updated last year
- WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…☆99Sep 1, 2024Updated last year
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆686Dec 25, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ISMIR 2023 Papers: A complete collection of influential and exciting research papers from the ISMIR 2023 conference.☆106Dec 2, 2023Updated 2 years ago
- ☆10May 15, 2021Updated 4 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Oct 11, 2022Updated 3 years ago
- Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perfo…☆60Dec 5, 2025Updated 3 months ago
- [COLING 2024] SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity☆13May 8, 2024Updated last year
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 3 months ago
- This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".☆11Dec 2, 2024Updated last year
- CMU multilingual speech repository☆30Apr 15, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Script to perform statistical significance test between ASR hypotheses.☆23Aug 13, 2017Updated 8 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings acc…☆24Jan 31, 2025Updated last year
- AI grand challenge 2020 Repo (Speech Recognition Track)☆22Oct 17, 2022Updated 3 years ago
- Official PyTorch implementation for "Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech …☆33May 11, 2025Updated 10 months ago
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆23Apr 23, 2024Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 5 months ago
- ACM MM 2022 - PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding☆11Aug 12, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…☆17Dec 11, 2022Updated 3 years ago
- This repository contains the official implementation of "A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular Data" (under revie…☆17Jul 10, 2024Updated last year
- ☆64May 23, 2022Updated 3 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 7 months ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆21Dec 20, 2023Updated 2 years ago
- Solos: A Dataset for Audio-Visual Music Analysis☆24Feb 17, 2023Updated 3 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆13Nov 28, 2024Updated last year
- DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization☆17Jul 26, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Unified Bayesian Estimator of EEG Reference at Infinity: rREST (Regularized Reference Electrode Standardization Technique)☆12Jun 27, 2022Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Official implementation for FlowSep☆70Jan 2, 2025Updated last year
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆124Sep 13, 2024Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆198Dec 13, 2024Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆86Aug 3, 2023Updated 2 years ago
- This toolbox extends frequency domain quantitative electroencephalography (qEEG) methods pursuing higher sensitivity to detect Brain Deve…☆13Jul 18, 2025Updated 8 months ago