kv1830 / fast_pdf_transLinks
Fast pdf translate是一款pdf翻译软件,基于MinerU实现pdf转markdown的功能,接着对markdown进行分割, 送给大模型翻译,最后组装翻译结果并由pypandoc生成结果pdf。
☆37Updated 8 months ago
Alternatives and similar repositories for fast_pdf_trans
Users that are interested in fast_pdf_trans are comparing it to the libraries listed below
Sorting:
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆219Updated last month
- ☆133Updated 8 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- ☆113Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆256Updated 4 months ago
- A Python Package to Access World-Class Generative Models☆129Updated last year
- ☆27Updated last year
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆48Updated 8 months ago
- Evaluation for AI apps and agent☆43Updated last year
- ☆67Updated last year
- Converted the Jina Tokenizer regex pattern to python.☆26Updated last year
- TianGong-AI-Unstructure☆69Updated 2 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆207Updated last year
- 中文论文、证券类、财报类PDF数据☆35Updated last year
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆75Updated 2 weeks ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆126Updated 3 months ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 顾名思义:手搓的RAG☆130Updated last year
- support BM25+vecetor☆29Updated 6 months ago
- python package to parse pdfs with different parsers☆209Updated 2 months ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆47Updated 11 months ago
- bisheng-unstructured library☆56Updated 6 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆47Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68Updated 2 years ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆73Updated last year
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆34Updated 4 months ago
- accelerate generating vector by using onnx model☆18Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 6 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆123Updated 5 months ago