DoodleBears / split-lang
✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux
☆53Updated 2 months ago
Alternatives and similar repositories for split-lang:
Users that are interested in split-lang are comparing it to the libraries listed below
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆194Updated last month
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆15Updated 7 months ago
- A streamlined, user-friendly JSON streaming preprocessor, crafted in Python.☆99Updated 7 months ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆17Updated 2 weeks ago
- Turn any OCR models into online inference API endpoint 🚀 🌖☆55Updated last month
- A lightweight end-to-end text-to-speech model☆113Updated 2 months ago
- 我从动漫中学习到的知识和人生感悟☆16Updated 2 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆160Updated 2 weeks ago
- Archived 🚧|🌻Building ChatBot with LLMs.🌻 | Using async requests. | 具有多 LLM 适应性 | 通用大语言模型代理端框架 |多人称全类型注解☆40Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- ☆13Updated 2 years ago
- A transformer-based multimodal model for music.☆28Updated 8 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆90Updated 7 months ago
- ☆19Updated last year
- A simple svs labeling tool☆13Updated 8 months ago
- Workflow Defined Engine☆24Updated 3 weeks ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated last month
- parallel fetch☆128Updated last week
- An English-to-Cantonese machine translation model☆49Updated last month
- Evaluation for AI apps and agent☆41Updated last year
- Download full or partial git-lfs repos without temporarily using 2x disk space☆31Updated last year
- Chinese tokens in tiktoken tokenizers.☆32Updated 11 months ago
- Turn PostgreSQL into your search engine in a Pythonic way.☆41Updated 3 weeks ago
- ☆32Updated last year
- Tool to allow parsing large JSON files without laoding into memory. Developed in Rust with adapters in other programming langauges for ea…☆28Updated last year
- 🌻 VITS ONNX TTS server designed for fast inference 🔥☆127Updated 3 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆34Updated 6 months ago
- A unified interface for multiple Text-to-Speech (TTS) providers.☆268Updated 4 months ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆26Updated last year