docling-project / docling-evalLinks
Evaluation framework for document processing models and services.
☆21Updated this week
Alternatives and similar repositories for docling-eval
Users that are interested in docling-eval are comparing it to the libraries listed below
Sorting:
- ☆28Updated 5 months ago
- ☆17Updated 3 weeks ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 3 weeks ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 4 months ago
- ☆16Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆21Updated 4 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Small python package to measure OCR quality and other related metrics.☆22Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- ☆8Updated 11 months ago
- ☆17Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆58Updated last month
- Training hybrid models for dummies.☆22Updated 5 months ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆17Updated last year
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆30Updated 9 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆25Updated 6 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Python library to use Pleias-RAG models☆57Updated last month
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆15Updated last week
- Efficiently computing & storing token n-grams from large corpora☆23Updated 8 months ago
- Submission to the inverse scaling prize☆23Updated last year
- ☆20Updated 3 months ago
- Rust bindings for CTranslate2☆14Updated last year
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated last month
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- ☆47Updated 4 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆31Updated this week