xstongxue / XS-VLM-OCRLinks
XS-VLM-OCR:大模型时代的OCR工具🚀
☆65Updated 2 weeks ago
Alternatives and similar repositories for XS-VLM-OCR
Users that are interested in XS-VLM-OCR are comparing it to the libraries listed below
Sorting:
- A specialized workbench for developers to engineer high-performance AI interactions, featuring a System Prompt Architect and a Conversati…☆64Updated 4 months ago
- 一个跑在 Cloudflare Workers 上的图片生成 API 代理,帮你智能优化提示词、代理图片,安全又好用!🚀☆83Updated 7 months ago
- Gemini polling proxy service (gemini轮询代理服务)☆60Updated 3 months ago
- 一款基于 PySide6 和 ElevenLabs API 的桌面应用,能将音视频或JSON转录稿智能地转换为高质量SRT字幕。特别为中、日、韩、英等语言优化了排版规则。☆119Updated 5 months ago
- MarkMuse is an innovative tool developed using Python that elegantly converts PDF files to Markdown format. By utilizing Mistral AI's OCR…☆32Updated 7 months ago
- 腾讯元宝逆向Chat2API。☆82Updated 7 months ago
- Midjourney prompt generator☆174Updated 3 weeks ago
- HE-Music is a multi-platform online music player based on SPlayer.☆77Updated this week
- 该仓库是一个基于Mistral API的文档识别工具,支持处理PDF和图片文件(如JPG、JPEG、PNG)。它提供图形用户界面和命令行界面,能够自动保存处理结果为Markdown格式,并支持配置文件管理和批量处理文件☆88Updated 9 months ago
- The Ultimate Course Scheduling Solution.☆37Updated 8 months ago
- 一款开源、优雅、高效的 GitHub Stars 管理工具,于万千星辰中,点亮你的每一份收藏。 An Open-source, Elegant, and Efficient GitHub Stars Management Tool. Illuminating Every T…☆86Updated 4 months ago
- ☆77Updated last year
- 使用硅基流动相关模型,将您的音频转换为文字☆55Updated 4 months ago
- Http图片列表程序☆55Updated 4 months ago
- 🧡 Folo is the AI Reader☆75Updated last month
- AI识别的邮件聚合客户端 | AI Powered Email Aggregation Client☆345Updated last month
- Chrome extension that can convert web pages to PDF, supports reading mode, editing, lazy loading of pictures. -- 可以将网页转换为 PDF的Chrome扩展,支持…☆61Updated 11 months ago
- PushToTalk 是一个高性能的桌面语音输入工具。它不仅仅是一个语音转文字工具,更集成了大语言模型(LLM)能力。你可以按住 Ctrl+Win 说话,松开后应用会自动将你的语音转为文字,并根据你的设定进行润色、翻译或整理成邮件,最后自动粘贴到当前光标位置。支持豆包/千问☆55Updated this week
- A browser extension that helps you quickly understand GitHub repository code by automatically adding DeepWiki and GitDiagram buttons.☆24Updated 8 months ago
- 一个简约无广,专注新闻的聚合体,完美适配Web端,手机端,《今日时事》为您实时聚合各大平台最新资讯,按时间序列 展示热点新闻动态,包含头条、百度、知乎、哔哩哔哩、豆瓣、微博、贴吧、汽车之家、虎扑、Github、抖音、懂车帝等各种消息,给您提供极致的专注阅读的信息流体验!☆252Updated 2 months ago
- ☆94Updated 8 months ago
- 一个简洁且优秀的描述是:这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展,使用先进的 ASR API。☆40Updated 2 months ago
- 重置你的设备码,更新和设置token☆94Updated 8 months ago
- This server acts as a central hub for Model Context Protocol (MCP) resource servers.☆178Updated 5 months ago
- 将你的项目一键部署到huggingface spaces☆73Updated 5 months ago
- E3 Sub Car, Base GraphAPI☆40Updated last month
- 接口已恢复 | 基于 https://chat.qwenlm.ai/ 的OCR。测试Token:可见readme 。已支持Docker一键部署,切换分支可见☆258Updated 7 months ago
- ZtoApi 是一个高性能的 OpenAI 兼容 API 代理服务器,专为 Z.ai 的 GLM-4.5 和 GLM-4.5V 模型设计。使用 Deno 原生 HTTP API 实现,支持完整的流式和非流式响应,提供实时监控 Dashboard,让你能够无缝地将 Z.ai …☆118Updated 3 months ago
- 部署于 CloudFlare Pages 的 AI 语音服务,使用 siliconflow 的语音转录模型 SenseVoiceSmall 和 openai 的 gpt-4o-mini-tts☆45Updated 3 months ago
- ☆121Updated 2 weeks ago