OPPO-Mente-Lab / fst-time-nluView external linksLinks
Extracting time features from text using a Finite State Transducer (FST) in Python
☆53Dec 1, 2025Updated 2 months ago
Alternatives and similar repositories for fst-time-nlu
Users that are interested in fst-time-nlu are comparing it to the libraries listed below
Sorting:
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 7 months ago
- noise reduction☆17Jul 3, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- faster inference☆28Jan 20, 2025Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- ☆36Sep 6, 2025Updated 5 months ago
- [AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny 300M model!☆86Jan 29, 2026Updated 2 weeks ago
- unofficial Split Mean Flow Implementation from bytedance☆66Aug 12, 2025Updated 6 months ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆14Jan 6, 2025Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆21Jan 19, 2026Updated 3 weeks ago
- Text-To-Speech for NotebookLM☆37Jul 20, 2025Updated 6 months ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆62Sep 5, 2025Updated 5 months ago
- SWIPE Algorithm implementation in Python☆11Dec 24, 2016Updated 9 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- 语音识别数字0-9☆13Jul 16, 2019Updated 6 years ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆43Oct 28, 2024Updated last year
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆63Dec 26, 2025Updated last month
- simple demo to visualize the details of one of the basic foundations of deep learning: convolution☆12Feb 22, 2019Updated 6 years ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- BMJ's Audo Programming☆10Jul 23, 2021Updated 4 years ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 2 months ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- ☆12Apr 30, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- bert蒸馏实践,包含BiLSTM蒸馏BERT和TinyBert☆13Apr 23, 2022Updated 3 years ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 8 months ago
- Basic library for spatial audio SOFA files☆12Sep 29, 2020Updated 5 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Sound2Synth Plug-Ins☆13Jul 28, 2022Updated 3 years ago
- This is the official repository of Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation.☆12Sep 25, 2024Updated last year
- A lightweight muji-moe chatbot created by Reecho.ai.☆12Oct 1, 2024Updated last year
- Improving Symbolic Music Generation with Inference-Time Alignment☆20Aug 2, 2025Updated 6 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- ☆97Oct 16, 2025Updated 4 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- Supplementary material for the ISMIR 2020 paper: “Deconstruct, Analyse, Reconstruct: how to improve tempo, beat, and downbeat estimation”…☆11Mar 2, 2021Updated 4 years ago