hitachi-nlp / appjsonifyLinks
A handy PDF-to-JSON conversion tool for academic papers implemented in Python.
☆70Updated 2 years ago
Alternatives and similar repositories for appjsonify
Users that are interested in appjsonify are comparing it to the libraries listed below
Sorting:
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆43Updated last year
- Scientific Document Insight Q/A☆31Updated last month
- Generalist and Lightweight Model for Text Classification☆163Updated 3 months ago
- Python API for https://vespa.ai, the open big data serving engine☆143Updated last week
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- [TACL] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retri…☆31Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 11 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- Evaluation framework for document processing models and services.☆43Updated this week
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆80Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆98Updated 10 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated last week
- ☆83Updated 4 months ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆42Updated 6 months ago
- ☆80Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆240Updated last year
- ☆50Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆155Updated last year
- ☆16Updated last year
- ☆102Updated last year
- experiments with inference on llama☆104Updated last year
- Mixtral finetuning☆19Updated last year
- Repository for deepdoctection tutorial notebooks☆45Updated 3 months ago
- PyLate efficient inference engine☆65Updated 3 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆66Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆214Updated 8 months ago
- PyTorch implementation for MRL☆19Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆134Updated last year
- Aligned, Review-Informed Edits of Scientific Papers☆54Updated 2 years ago