A set of tools to create synthetically-generated data from documents
☆39Aug 15, 2025Updated 6 months ago
Alternatives and similar repositories for docling-sdg
Users that are interested in docling-sdg are comparing it to the libraries listed below
Sorting:
- Docling workshops☆40Feb 24, 2026Updated last week
- Making docling agentic through MCP☆426Jan 22, 2026Updated last month
- MCP server for retrieval augmented thinking and problem solving☆20Aug 13, 2025Updated 6 months ago
- ☆22Feb 1, 2025Updated last year
- Simple package to extract text with coordinates from programmatic PDFs☆245Feb 25, 2026Updated last week
- Build document-native LLM applications☆56Sep 11, 2024Updated last year
- An MCP server that provides image recognition 👀 capabilities using Anthropic and OpenAI vision APIs☆35Apr 12, 2025Updated 10 months ago
- Docling core data types and transformations☆230Updated this week
- Evaluation framework for document processing models and services.☆64Feb 12, 2026Updated 3 weeks ago
- [TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Thr…☆34Dec 5, 2025Updated 3 months ago
- plugin to add metadata to the head of a document☆39Jul 17, 2024Updated last year
- Small C# caching and cache-filling library, intended as a replacement for memcached in many cases.☆14Apr 28, 2025Updated 10 months ago
- TOON as DSPy adapter☆25Feb 1, 2026Updated last month
- ☆27Updated this week
- A Model Context Protocol server that provides documentation access capabilities. This server enables LLMs to search and retrieve content …☆18Apr 29, 2025Updated 10 months ago
- ☆17Feb 20, 2026Updated last week
- mcp server for gitingest☆136Mar 21, 2025Updated 11 months ago
- ☆38Jan 19, 2026Updated last month
- ☆17Jun 8, 2025Updated 8 months ago
- A small example of a generic host based .NET core app which can be run as a Windows Service.☆11Aug 15, 2018Updated 7 years ago
- OpenPGP in Python using Sequoia PGP☆18Feb 25, 2026Updated last week
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- IEEE Taxonomy in RDF (with Python tool for converting it from txt to rdf)☆12Dec 22, 2025Updated 2 months ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Apr 25, 2024Updated last year
- AgRec is an open source Agriculture Recommendations from the Cooperative Extension Services.☆12Jan 8, 2022Updated 4 years ago
- ☆13May 10, 2023Updated 2 years ago
- AI Counselor is your daily companion that tracks your emotions and provides you with a summary along with action points at the end of the…☆11Dec 3, 2023Updated 2 years ago
- ☆11Dec 23, 2023Updated 2 years ago
- RDP Credential Provider☆11Oct 29, 2025Updated 4 months ago
- ☆15Aug 5, 2025Updated 7 months ago
- Automatically generate tests for your website by using LLM models☆17Aug 7, 2023Updated 2 years ago
- Saltstack AWS creates all the non-server components of a AWS hosted datacenter☆11Aug 23, 2018Updated 7 years ago
- CLI for creating a new Payload project☆11Oct 13, 2023Updated 2 years ago
- A project to convert the default and contrib.humanize template filters from Django to JavaScript☆14Updated this week
- CveBinarySheet: A Comprehensive Pre-built Binaries Database Focused on IoT Vulnerability Scenarios☆15Jan 17, 2025Updated last year
- ☆17Feb 26, 2026Updated last week
- Code to reproduce the material covered in Kùzu's YouTube tutorials☆20Oct 10, 2025Updated 4 months ago
- 🤖 AIDevOS: AI-Driven Autonomous DevOps System | Multi-agent collaboration framework with DSPy integration for automated application deve…☆14Mar 3, 2025Updated last year
- ☆29Dec 11, 2025Updated 2 months ago