Overview of pipelines related to PDF to Markdown document processing.
☆94Oct 31, 2025Updated 4 months ago
Alternatives and similar repositories for pdf-extraction-agenda
Users that are interested in pdf-extraction-agenda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prompt Engineering for Developers☆16Oct 15, 2025Updated 5 months ago
- 使用FastAPI构建发票识别系统后端服务,支持并发。使用ERFNet模型训练发票轮廓检测,进行畸变矫正,OCR识别,模板匹配,支持倾斜发票识别。准确率99.9%。☆13May 8, 2025Updated 10 months ago
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated last year
- This is a smart chunker for efficient preparing of long document for RAG☆13Updated this week
- ☆13Jul 13, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Continuous diffusion for layout generation☆54Feb 19, 2025Updated last year
- Fork of RecurrentGPT with modifications☆10Sep 18, 2024Updated last year
- Identify VMess packets in network traffic☆13Nov 21, 2022Updated 3 years ago
- CLIPCleaner: Cleaning Noisy Labels with CLIP (ACM MM2024)☆15Apr 28, 2025Updated 11 months ago
- Compute benchmark of table structure recognition.☆28Dec 2, 2025Updated 3 months ago
- https://arxiv.org/abs/2201.06499☆29Apr 9, 2024Updated last year
- ☆14Oct 14, 2022Updated 3 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆30May 24, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Designed an android application using android studio 1.3, java, xml. This application is a digital version of the actual Monopoly game. I…☆11Sep 25, 2021Updated 4 years ago
- Library for industrial alignment.☆405Sep 24, 2025Updated 6 months ago
- Code for "HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking"☆90Nov 18, 2025Updated 4 months ago
- В этом репозитории содержатся примеры реализации вопрос-ответного бота по документации на базе YandexGPT и других сервисов Yandex Cloud☆33Feb 12, 2024Updated 2 years ago
- CemuLauncher - A custom Launcher for Cemu with advanced features.☆15Aug 18, 2021Updated 4 years ago
- RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis☆12Sep 2, 2015Updated 10 years ago
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- MCP server for retrieving relevant documentation from a knowledge base☆14Mar 3, 2025Updated last year
- ☆28Oct 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Most basic AI Assistant demo derived from the DeepPavlov Dream AI Assistant.☆13May 22, 2023Updated 2 years ago
- Code for the paper "Evading Black-box Classifiers Without Breaking Eggs" [SaTML 2024]☆21Apr 15, 2024Updated last year
- mcp scan☆22Jan 24, 2025Updated last year
- HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization☆17May 29, 2025Updated 10 months ago
- ☆10Aug 30, 2022Updated 3 years ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- Tools and Funtions for Open Webui☆18Jul 4, 2024Updated last year
- ISWC2020 Semantic Web Challenge - Product Classification Top1 Solution☆15Nov 18, 2020Updated 5 years ago
- ITMO course☆12Mar 26, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 首个全参数训练的知识产权大模型 MoZi (墨子)☆26Aug 20, 2024Updated last year
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆27Mar 2, 2025Updated last year
- code☆15Jun 21, 2020Updated 5 years ago
- On Finetuning Tabular Foundation Models Paper Code☆35Sep 3, 2025Updated 6 months ago
- ☆14Dec 21, 2024Updated last year
- Nano-BERT is a straightforward, lightweight and comprehensible custom implementation of BERT, inspired by the foundational "Attention is …☆20Oct 19, 2023Updated 2 years ago
- ☆11Dec 8, 2022Updated 3 years ago