Airmomo / SmolDocling-256M-WebUILinks
WebUI for using SmolDocling-256M-preview
☆13Updated 6 months ago
Alternatives and similar repositories for SmolDocling-256M-WebUI
Users that are interested in SmolDocling-256M-WebUI are comparing it to the libraries listed below
Sorting:
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆128Updated 6 months ago
- 基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署☆33Updated last year
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆180Updated 2 months ago
- bisheng-unstructured library☆55Updated 4 months ago
- ☆180Updated 8 months ago
- Sample GLM4V + ChatTTS AI assistant☆85Updated last year
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆166Updated 6 months ago
- MinerU API server☆74Updated 9 months ago
- ☆27Updated 4 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆206Updated 11 months ago
- offline 2d digitalhuman demo for edge devices (android/ios/etc.)☆82Updated last year
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆72Updated 4 months ago
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆63Updated 2 weeks ago
- 视频理解:千问视频多模态模型 & Dify☆65Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated last year
- Easegen is an open-source digital human course creation platform offering comprehensive solutions from course production and video manage…☆242Updated 5 months ago
- mirror of https://huggingface.co/spaces/enzostvs/deepsite☆77Updated 6 months ago
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆26Updated 6 months ago
- generate ppt with llm☆99Updated last year
- Using GPT to parse PDF☆100Updated last year
- an open high-performance Optical Character Recognition (OCR) toolkit☆295Updated 2 months ago
- 文本语料转训练集工具,txt转dataset☆94Updated last year
- ☆269Updated 9 months ago
- 一个用于F5-TTS的api和webui项目☆64Updated 9 months ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆105Updated last year
- Real time faster whisper gradio☆26Updated last month
- lang2openai☆73Updated 11 months ago
- ☆29Updated last year
- 基于ChatGLM2带的openai_api.py修改支持ChatGLM3。☆20Updated last year
- project page for ChatAnyone☆113Updated 6 months ago