This package enables inference of header hierarchy in the docling PDF parsing pipeline.
☆58Feb 19, 2026Updated 2 weeks ago
Alternatives and similar repositories for docling-hierarchical-pdf
Users that are interested in docling-hierarchical-pdf are comparing it to the libraries listed below
Sorting:
- ☆24Aug 26, 2025Updated 6 months ago
- ☆19Feb 27, 2025Updated last year
- Neural network based lemmatizer for Finnish language☆11Sep 10, 2020Updated 5 years ago
- Create synthetic datasets from scratch using AI-powered generation. Define topics, customize prompts, and generate high-quality reasoning…☆29Mar 1, 2026Updated last week
- Automation for IBM Watson Deployments☆17Sep 17, 2025Updated 5 months ago
- Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.☆29Mar 2, 2026Updated last week
- Generates a template deck for architecture diagrams☆22Aug 31, 2025Updated 6 months ago
- Graphlit Platform☆30Feb 20, 2024Updated 2 years ago
- The dataset used to evaluate JobBERT on the task of job title normalization.☆28Sep 10, 2022Updated 3 years ago
- ☆34Apr 30, 2025Updated 10 months ago
- Multi-tenancy assets for IBM clients to build SaaS☆33Jul 22, 2022Updated 3 years ago
- ☆28Nov 6, 2023Updated 2 years ago
- A guidance compatibility layer for llama-cpp-python☆36Sep 11, 2023Updated 2 years ago
- BeHonest: Benchmarking Honesty in Large Language Models☆34Aug 15, 2024Updated last year
- Staging area for a public release of Theorizer☆154Feb 11, 2026Updated 3 weeks ago
- AI Agents, LLM Fine-tuning, Developer Productivity, Governance, IBM watsonx☆49Jan 7, 2026Updated 2 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆44Jul 16, 2024Updated last year
- Sample applications that use IBM embeddable AI libraries and linked from https://dsce.ibm.com☆45Jan 21, 2026Updated last month
- Retrieval-augmented generation (RAG) for remote & local LLM use☆44May 24, 2025Updated 9 months ago
- ☆46Jun 11, 2025Updated 8 months ago
- The Agent Lifecycle Toolkit (ALTK) is a library of components to help agent builders improve their agent with minimal integration effort …☆111Feb 6, 2026Updated last month
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆51Oct 31, 2024Updated last year
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆51Aug 24, 2025Updated 6 months ago
- Build document-native LLM applications☆56Sep 11, 2024Updated last year
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆82Feb 8, 2026Updated last month
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 3 months ago
- A Corpus of 475,000 Industrial Occupations☆70Nov 20, 2020Updated 5 years ago
- Use Claude Code on Kanban WebUI☆143Jan 28, 2026Updated last month
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆71Jul 11, 2023Updated 2 years ago
- Easily view and modify JSON datasets for large language models☆87May 16, 2025Updated 9 months ago
- ☆74Sep 27, 2024Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Oct 16, 2023Updated 2 years ago
- Python package wrapping llama.cpp for on-device LLM inference☆101Oct 12, 2025Updated 4 months ago
- The AI runtime that turns your framework functions into OpenAI compatible endpoints☆88Feb 27, 2025Updated last year
- This is a repository which uses LangChain LangGraph and DuckduckGo to create a Perplexity Clone☆89Oct 6, 2024Updated last year
- Code for KaLM-Embedding models☆114Jun 30, 2025Updated 8 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Apr 17, 2025Updated 10 months ago
- ☆114Jul 1, 2025Updated 8 months ago