It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool.它是一个TTS多语言(97种语言)的混合文本内容自动识别和拆分工具。
☆21Feb 20, 2024Updated 2 years ago
Alternatives and similar repositories for LangSegment
Users that are interested in LangSegment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆245Feb 25, 2026Updated last month
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆50Feb 17, 2026Updated last month
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆62Sep 1, 2024Updated last year
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- [ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"☆27Sep 9, 2025Updated 6 months ago
- ☆21Apr 24, 2025Updated 11 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 10 months ago
- Cantonese Video Transcribe Service☆22Jul 25, 2025Updated 8 months ago
- Generating Sentences from Disentangled Syntactic and Semantic Spaces☆11Jun 24, 2019Updated 6 years ago
- ☆36Jan 6, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆43Feb 8, 2025Updated last year
- Code for EMNLP2021 paper “Transductive Learning for Unsupervised Text Style Transfer”☆12Sep 19, 2021Updated 4 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- ☆13Jan 2, 2025Updated last year
- WPF canvas that lets the user drag elements on the canvas. Helps user by snapping the elements close to each other.☆10Apr 18, 2024Updated last year
- Adapting a ConvNeXt model to audio classification on AudioSet☆27Feb 19, 2025Updated last year
- Dataset, code and results repository for SBA-Net.☆14Sep 23, 2022Updated 3 years ago
- flow mirror models from JZX AI Labs☆43Sep 30, 2024Updated last year
- Unity GUI架构下,支持UI组件以及特效的软裁剪,使用中如有遇到问题,可发邮件12666146@qq.com联系☆11Sep 9, 2016Updated 9 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆44Mar 15, 2024Updated 2 years ago
- ☆14Oct 30, 2021Updated 4 years ago
- Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings acc…☆24Jan 31, 2025Updated last year
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆17May 24, 2020Updated 5 years ago
- php相关的开发demo、工具类以及一些零碎的东西☆11Sep 24, 2021Updated 4 years ago
- ☆111Mar 9, 2026Updated 2 weeks ago
- 通过golang的cgo特性,结合PHP的扩展开发的所需C库,编译链接成PHP扩展,提供给PHP脚本使用☆11Jun 19, 2022Updated 3 years ago
- 公众号token服务,统一管理token,jsticket,过期后自动从官网获取,保存到数据库、本地缓存,并对外提供获取、刷新接口☆15Dec 20, 2021Updated 4 years ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Korean C…☆17Jan 9, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models☆64Feb 1, 2026Updated last month
- Masked ConditionaL Neural Networks☆15Jul 6, 2023Updated 2 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Weird autoencoder experiments☆24Jan 26, 2026Updated last month
- poorman's ar-dit tts☆45Dec 31, 2025Updated 2 months ago
- A gopush(https://github.com/Terry-Mao/gopush-cluster/) client for iOS and MacOSX☆21Jul 26, 2017Updated 8 years ago
- Tencent 2019届Minigame参赛金奖作品--夏趣☆10Feb 1, 2022Updated 4 years ago