LlmKira / fast-langdetect
⚡️ 80x faster language detection with Fasttext | Split text by language for TTS
☆120Updated last month
Related projects ⓘ
Alternatives and complementary repositories for fast-langdetect
- A streamlined, user-friendly JSON streaming preprocessor, crafted in Python.☆73Updated last month
- Evaluation for AI apps and agent☆35Updated 9 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆71Updated 6 months ago
- Conversational Retrieval Evaluation Dataset☆91Updated last month
- ☆50Updated 2 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆42Updated 2 months ago
- ☆249Updated 3 months ago
- ☆32Updated 9 months ago
- ☆98Updated last week
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆210Updated 2 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆147Updated last week
- Extract structured text from pdfs quickly☆334Updated 2 weeks ago
- LLM steganography with minimum-entropy coupling - Hiding encrypted messages in natural language.☆72Updated 2 months ago
- Speech Diarization for scrum automation☆97Updated last year
- Chat with any website on your local machine☆71Updated 4 months ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆25Updated 10 months ago
- Prompt 工程师利器,可同时比较多个 Prompts 在多个 LLM 模型上的效果☆97Updated last year
- A prompting library☆122Updated last month
- 用文本编辑器剪视频☆36Updated last year
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and langua☆35Updated last week
- 🔧 Repair JSON!Solution for JSON Anomalies from LLMs.☆179Updated 3 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆23Updated 4 months ago
- Turn any OCR models into online inference API endpoint 🚀 🌖☆50Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆220Updated 3 months ago
- MacOS Agent: A Simplified Assistant for Your Mac☆53Updated 3 months ago
- ☆298Updated 2 weeks ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last month
- Lightweight, performant, deep table extraction☆315Updated last week
- Open source inference code for Rev's model☆329Updated last week