Overview of pipelines related to PDF to Markdown document processing.
☆98Oct 31, 2025Updated 7 months ago
Alternatives and similar repositories for pdf-extraction-agenda
Users that are interested in pdf-extraction-agenda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated last year
- ☆12Jul 13, 2023Updated 2 years ago
- ☆13Apr 4, 2024Updated 2 years ago
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆13Feb 27, 2024Updated 2 years ago
- ☆16Nov 9, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Compute benchmark of table structure recognition.☆29Dec 2, 2025Updated 6 months ago
- The Code for the EMNLP 2023 main conference paper "Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition…☆13Dec 10, 2023Updated 2 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- An interactive RAG agent built with LangChain and MongoDB Atlas. Manage your knowledge base, switch embedding models, and tune retrieval …☆42Dec 19, 2025Updated 5 months ago
- EACL 2021☆11May 4, 2021Updated 5 years ago
- Script and patches for building TrebleDroid AOSP☆11Aug 28, 2024Updated last year
- Code for "HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking"☆98Nov 18, 2025Updated 6 months ago
- RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis☆12Sep 2, 2015Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- Make downloading scientific data much easier☆13Mar 3, 2026Updated 3 months ago
- ☆28Oct 14, 2024Updated last year
- Most basic AI Assistant demo derived from the DeepPavlov Dream AI Assistant.☆14May 22, 2023Updated 3 years ago
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Dec 21, 2020Updated 5 years ago
- mcp scan☆22Jan 24, 2025Updated last year
- HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization☆18May 29, 2025Updated last year
- ☆13Sep 28, 2020Updated 5 years ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ArXiv daily dump and viewer using GitHub Actions - luvata.github.io/arxive☆14Updated this week
- ISWC2020 Semantic Web Challenge - Product Classification Top1 Solution☆15Nov 18, 2020Updated 5 years ago
- Code for our paper accepted at EMNLP 2023 (Findings)☆14Jan 5, 2024Updated 2 years ago
- A plugin for the Flow Launcher. When opening or saving files, quickly jump to the directory you already opened in File Explorer. Inspired…☆20Jun 7, 2025Updated last year
- code☆15Jun 21, 2020Updated 5 years ago
- even-realities G1 smart glasses flutter blue plus implementation for flutter side connection☆18Nov 18, 2024Updated last year
- 首个全参数训练的知识产权大模型 MoZi (墨子)☆28Aug 20, 2024Updated last year
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆27Mar 2, 2025Updated last year
- A neural RST discourse parser with well pre-trained XLNet.☆17Jun 13, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Dec 8, 2022Updated 3 years ago
- ☆14May 11, 2021Updated 5 years ago
- ☆14Dec 21, 2024Updated last year
- PowerPoint presentation builder from template using Jinja2☆18May 27, 2025Updated last year
- ☆18Jun 18, 2021Updated 4 years ago
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- An unofficial MCP interface to interact with the PapersWithCode API☆22Jun 7, 2025Updated last year