alibaba-damo-academy / SpokenNLPView external linksLinks
A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.
☆124May 27, 2025Updated 8 months ago
Alternatives and similar repositories for SpokenNLP
Users that are interested in SpokenNLP are comparing it to the libraries listed below
Sorting:
- Official code for ICLR 2022 paper: "PoNet: Pooling Network for Efficient Token Mixing in Long Sequences".☆33May 23, 2023Updated 2 years ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆20Oct 11, 2024Updated last year
- Implementation of the paper: Text Segmentation as a Supervised Learning Task☆265Oct 2, 2019Updated 6 years ago
- ☆33May 16, 2023Updated 2 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ☆15Aug 10, 2022Updated 3 years ago
- ☆23Jul 13, 2021Updated 4 years ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- A pipeline architecture for temporal segmentation of video lectures.☆11Sep 8, 2020Updated 5 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 9 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 2 years ago
- Code for the paper at ACL2018☆30Sep 30, 2018Updated 7 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Official code of ElasticAST (Interspeech 2024 paper)☆34Jul 30, 2024Updated last year
- Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.☆20Jan 5, 2026Updated last month
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- noise reduction☆17Jul 3, 2024Updated last year
- ☆16Dec 23, 2021Updated 4 years ago
- ☆14Mar 15, 2022Updated 3 years ago
- ☆106Jul 6, 2021Updated 4 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Jun 29, 2023Updated 2 years ago
- ChineseBert用于中文拼写纠错☆43Mar 14, 2023Updated 2 years ago
- Hierarchical entity typing via multi-level learning to rank☆12Oct 13, 2020Updated 5 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆24Apr 12, 2024Updated last year
- ccks金融事件主体抽取☆74Oct 21, 2020Updated 5 years ago
- 2021 CCF BDCI 全国信息检索挑战杯(CCIR-Cup)智 能人机交互自然语言理解赛道第二名参赛解决方案☆24Oct 27, 2021Updated 4 years ago
- Common Code Workflow tutorial on Theano☆16Oct 29, 2015Updated 10 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Nov 29, 2022Updated 3 years ago
- The source code of paper "An Effective System for Multi-format Information Extraction".☆18Aug 14, 2021Updated 4 years ago
- MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music☆24Jan 7, 2026Updated last month
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated 8 months ago
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆40Jun 17, 2025Updated 7 months ago
- FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…☆441Jan 25, 2024Updated 2 years ago
- Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension☆171Apr 20, 2022Updated 3 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆104Mar 30, 2025Updated 10 months ago
- [NeurIPS 2023] CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews☆25Jul 29, 2024Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆51Mar 2, 2023Updated 2 years ago