Overview of pipelines related to PDF to Markdown document processing.
☆94Oct 31, 2025Updated 6 months ago
Alternatives and similar repositories for pdf-extraction-agenda
Users that are interested in pdf-extraction-agenda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prompt Engineering for Developers☆16Oct 15, 2025Updated 6 months ago
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated last year
- This is a smart chunker for efficient preparing of long document for RAG☆13Mar 24, 2026Updated last month
- ☆12Jul 13, 2023Updated 2 years ago
- RSS Launchpad web extension: quickly add new RSS/Atom subscriptions from websites☆20May 18, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Apr 4, 2024Updated 2 years ago
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆13Feb 27, 2024Updated 2 years ago
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- An interactive RAG agent built with LangChain and MongoDB Atlas. Manage your knowledge base, switch embedding models, and tune retrieval …☆42Dec 19, 2025Updated 4 months ago
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆32May 24, 2025Updated 11 months ago
- Code for "HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking"☆96Nov 18, 2025Updated 5 months ago
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- ☆28Oct 14, 2024Updated last year
- HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization☆17May 29, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- ArXiv daily dump and viewer using GitHub Actions - luvata.github.io/arxive☆14Updated this week
- Code for our paper accepted at EMNLP 2023 (Findings)☆14Jan 5, 2024Updated 2 years ago
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆27Mar 2, 2025Updated last year
- 项目的issue会存放我的所有blog☆19Sep 12, 2025Updated 7 months ago
- On Finetuning Tabular Foundation Models Paper Code☆37Sep 3, 2025Updated 7 months ago
- Документация JSON API 1.2☆18Updated this week
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- Pure javascript XML SAX parser for Node.js☆19Mar 24, 2016Updated 10 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Implementation of logistic regression using numpy☆15Aug 2, 2019Updated 6 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Dec 21, 2022Updated 3 years ago
- ☆14May 11, 2021Updated 4 years ago
- ☆17Jul 24, 2023Updated 2 years ago
- ☆19Feb 3, 2026Updated 2 months ago
- Send emails in a comfortable way via models.☆89Jun 25, 2015Updated 10 years ago
- rst-workbench enables the hassle-free installation of RST parsers. It lets you visually compare their results in your browser.☆19Apr 12, 2024Updated 2 years ago
- chatGPT integrated into Telegram using official OpenAI API☆16Mar 2, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An unofficial MCP interface to interact with the PapersWithCode API☆22Jun 7, 2025Updated 10 months ago
- russian TTS☆83Feb 11, 2025Updated last year
- The minimalist JavaScript loader☆39Nov 11, 2013Updated 12 years ago
- 1st place solution for GramEval-2020☆14Jan 13, 2023Updated 3 years ago
- VQA-Med 2020☆16Jan 27, 2023Updated 3 years ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated last year
- Extends Tape with new assertions☆15Feb 14, 2016Updated 10 years ago