arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily
☆15Jan 6, 2025Updated last year
Alternatives and similar repositories for nlp_arxiv_daily
Users that are interested in nlp_arxiv_daily are comparing it to the libraries listed below
Sorting:
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 3 months ago
- Official implementation of WildFX Dataset Generating pipeline.☆15Oct 21, 2025Updated 4 months ago
- Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better u…☆26Apr 19, 2024Updated last year
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- A pytorch implementation of D3Net.☆11Aug 8, 2021Updated 4 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Music onset detector from Dance Dance Convolution packaged as a lightweight PyTorch module☆43Sep 22, 2023Updated 2 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆45Mar 27, 2024Updated last year
- Automatic subordinate clause extractor☆11Jul 7, 2022Updated 3 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- Official repo for ICCV 2025 paper "Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation"☆17Sep 3, 2025Updated 6 months ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆27Feb 13, 2026Updated 3 weeks ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆43Mar 3, 2025Updated last year
- ☆78Sep 25, 2025Updated 5 months ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- ☆11Aug 13, 2024Updated last year
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- DISCO: Comprehensive and Explainable Disinformation Detection, CIKM 2022☆10May 5, 2023Updated 2 years ago
- [COLING22] Text-to-Text Extraction and Verbalization of Biomedical Event Graphs☆10Nov 5, 2022Updated 3 years ago
- A lightweight muji-moe chatbot created by Reecho.ai.☆13Oct 1, 2024Updated last year
- Today's News Online (TNO) is a news aggregation system that takes in news sources of varying types and provides a single location for cli…☆17Updated this week
- Sound2Synth Plug-Ins☆13Jul 28, 2022Updated 3 years ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆11Apr 30, 2024Updated last year
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- Open-source, knowledge-grounded conversational assistant☆14Jun 30, 2025Updated 8 months ago
- ☆12Jan 7, 2020Updated 6 years ago
- Repo for the Unified Verbs Index Project☆12Feb 3, 2026Updated last month
- ☆14Jun 11, 2025Updated 8 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Details of the datasets for Few-shot class-incremental audio classification☆11Dec 6, 2023Updated 2 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- Basic library for spatial audio SOFA files☆12Sep 29, 2020Updated 5 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago