Overview of pipelines related to PDF to Markdown document processing.
☆94Oct 31, 2025Updated 6 months ago
Alternatives and similar repositories for pdf-extraction-agenda
Users that are interested in pdf-extraction-agenda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用FastAPI构建发票识别系统后端服务,支持并发。使用ERFNet模型训练发票轮廓检测,进行畸变矫正,OCR识别,模板匹配,支持倾斜发票识别。准确率99.9%。☆13May 8, 2025Updated last year
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated last year
- ☆12Jul 13, 2023Updated 2 years ago
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆13Feb 27, 2024Updated 2 years ago
- A curated list of my GitHub stars!☆17May 15, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Fork of RecurrentGPT with modifications☆10Sep 18, 2024Updated last year
- Identify VMess packets in network traffic☆13Nov 21, 2022Updated 3 years ago
- ☆16Nov 9, 2025Updated 6 months ago
- Compute benchmark of table structure recognition.☆28Dec 2, 2025Updated 5 months ago
- https://arxiv.org/abs/2201.06499☆29Apr 9, 2024Updated 2 years ago
- f("A1") = 𓀀; also A1.png☆12May 15, 2026Updated last week
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- An interactive RAG agent built with LangChain and MongoDB Atlas. Manage your knowledge base, switch embedding models, and tune retrieval …☆42Dec 19, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆32May 24, 2025Updated 11 months ago
- Designed an android application using android studio 1.3, java, xml. This application is a digital version of the actual Monopoly game. I…☆11Sep 25, 2021Updated 4 years ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆29Apr 16, 2023Updated 3 years ago
- Script and patches for building TrebleDroid AOSP☆11Aug 28, 2024Updated last year
- ☆34Apr 14, 2025Updated last year
- Library for industrial alignment.☆403May 8, 2026Updated 2 weeks ago
- В этом репозитории содержатся примеры реализации вопрос-ответного бота по документации на базе YandexGPT и других сервисов Yandex Cloud☆33Feb 12, 2024Updated 2 years ago
- RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis☆12Sep 2, 2015Updated 10 years ago
- ☆28Oct 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Most basic AI Assistant demo derived from the DeepPavlov Dream AI Assistant.☆14May 22, 2023Updated 3 years ago
- ☆22Mar 31, 2022Updated 4 years ago
- ☆10Aug 30, 2022Updated 3 years ago
- Nodes to trigger workflows from Discord or send interactive messages☆17Nov 3, 2024Updated last year
- ISWC2020 Semantic Web Challenge - Product Classification Top1 Solution☆15Nov 18, 2020Updated 5 years ago
- This is a repo for DCQA QUD parsing implemenation☆11Aug 5, 2025Updated 9 months ago
- Russian coreference resolution competition☆11Mar 24, 2023Updated 3 years ago
- code☆15Jun 21, 2020Updated 5 years ago
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆28Mar 2, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Java client library for https://mcp.run - call portable and secure tools for your AI Agents and Apps☆28Oct 31, 2025Updated 6 months ago
- Implementation of logistic regression using numpy☆15Aug 2, 2019Updated 6 years ago
- A neural RST discourse parser with well pre-trained XLNet.☆17Jun 13, 2022Updated 3 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Dec 21, 2022Updated 3 years ago
- MIPT course☆14May 23, 2021Updated 4 years ago
- Connect other bots as pipes.☆11Nov 6, 2019Updated 6 years ago