hitachi-nlp / appjsonify
A handy PDF-to-JSON conversion tool for academic papers implemented in Python.
☆52Updated 11 months ago
Related projects: ⓘ
- Repository for deepdoctection tutorial notebooks☆36Updated 2 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆67Updated 2 months ago
- End-to-end zero-shot entity and relation extraction☆50Updated last month
- Gzip and nearest neighbors for text classification☆56Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆33Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- Mixtral finetuning☆19Updated 7 months ago
- ☆24Updated 2 months ago
- ☆78Updated 4 months ago
- Code for NeurIPS LLM Efficiency Challenge☆52Updated 5 months ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆17Updated last year
- ☆71Updated 3 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated 8 months ago
- ☆31Updated last year
- A Retrieval Benchmark for Scientific Literature Search☆53Updated 2 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆43Updated 8 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated 8 months ago
- General solution to archetype LLM batch use case