A set of tools to create synthetically-generated data from documents
☆48Aug 15, 2025Updated 9 months ago
Alternatives and similar repositories for docling-sdg
Users that are interested in docling-sdg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build document-native LLM applications☆58Sep 11, 2024Updated last year
- ☆204May 8, 2026Updated 2 weeks ago
- Docling core data types and transformations☆255May 19, 2026Updated last week
- MCP server for retrieval augmented thinking and problem solving☆23Aug 13, 2025Updated 9 months ago
- Docling Haystack integration☆29Apr 9, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- OCI container images. A Slinky project.☆21May 18, 2026Updated last week
- Examples using the Deep Search functionalities☆87Jan 29, 2025Updated last year
- Shared virtualization management library☆32May 19, 2026Updated last week
- Automatically generated Sysmon parser for Azure Sentinel☆18Jan 6, 2026Updated 4 months ago
- ☆29Dec 11, 2025Updated 5 months ago
- Adapt MLLMs to Domains via Post-Training (EMNLP 2025 Findings)☆14Nov 11, 2025Updated 6 months ago
- extracts shellcode from a nasm compile macho binary☆17Jan 28, 2021Updated 5 years ago
- Recovered samples, extracted Wasm/binaries, decoded payloads & analysis scripts from the Coruna iOS/macOS exploit kit (b27.icu). 28 JS mo…☆55May 9, 2026Updated 2 weeks ago
- A Docusaurus plugin that generates a concatenated markdown file from your documentation under /llms.txt☆32Nov 15, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- HPK allows running Kubernetes applications within HPC by translating deployments to Slurm and Singularity/Apptainer☆29Jan 30, 2026Updated 3 months ago
- An external provider for Llama Stack allowing for the use of RamaLama for inference.☆21Dec 22, 2025Updated 5 months ago
- Framework for deploying configurable AI agents with real-time streaming and tool execution.☆40Sep 18, 2025Updated 8 months ago
- Small C# caching and cache-filling library, intended as a replacement for memcached in many cases.☆15Apr 28, 2025Updated last year
- Transform Claude Code transcript JSONL files into readable terminal and HTML formats.☆76Feb 10, 2026Updated 3 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- [NAACL 2025 🔥] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.☆38Apr 17, 2025Updated last year
- [TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Thr…☆34Dec 5, 2025Updated 5 months ago
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆227Jan 24, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository for the KVP10k dataset☆23Sep 18, 2025Updated 8 months ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 3 years ago
- TOON as DSPy adapter☆26Feb 1, 2026Updated 3 months ago
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Aug 27, 2025Updated 8 months ago
- ☆11Dec 23, 2023Updated 2 years ago
- 让更多的小伙伴投入到开源事业中,让独立的设计能力帮助更多开发者☆16May 31, 2024Updated last year
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆23May 18, 2026Updated last week
- An MCP server that provides image recognition 👀 capabilities using Anthropic and OpenAI vision APIs☆38Apr 12, 2025Updated last year
- Evaluation framework for document processing models and services.☆73May 15, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An LLM based shell assistant that knows your usual shell commands.☆17Jul 18, 2025Updated 10 months ago
- RDP Credential Provider☆12Oct 29, 2025Updated 6 months ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Mar 25, 2026Updated 2 months ago
- 一个基于 Cloudflare Workers 的 OpenAI API 代理服务,支持多渠道管理、Token 管理和使用量统计☆25Apr 26, 2026Updated last month
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆54Feb 27, 2025Updated last year
- NodeJS - native macOS process list loader☆25Feb 13, 2023Updated 3 years ago