LlmKira / fast-langdetect
⚡️ 80x faster language detection with Fasttext | Split text by language for TTS
☆127Updated last month
Related projects ⓘ
Alternatives and complementary repositories for fast-langdetect
- Evaluation for AI apps and agent☆35Updated 10 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆73Updated 7 months ago
- A streamlined, user-friendly JSON streaming preprocessor, crafted in Python.☆75Updated 2 months ago
- ☆50Updated 3 months ago
- Conversational Retrieval Evaluation Dataset☆91Updated last month
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆212Updated 2 months ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆25Updated 10 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆42Updated 3 months ago
- ☆32Updated 9 months ago
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。☆64Updated this week
- LLM steganography with minimum-entropy coupling - Hiding encrypted messages in natural language.☆73Updated 2 months ago
- ☆34Updated 2 weeks ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆152Updated 3 weeks ago
- 🔧 Repair JSON!Solution for JSON Anomalies from LLMs.☆192Updated 4 months ago
- Speech Diarization for scrum automation☆97Updated last year
- ☆264Updated last week
- MacOS Agent: A Simplified Assistant for Your Mac☆54Updated 3 months ago