hitachi-nlp / appjsonify
A handy PDF-to-JSON conversion tool for academic papers implemented in Python.
☆64Updated last year
Alternatives and similar repositories for appjsonify:
Users that are interested in appjsonify are comparing it to the libraries listed below
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆74Updated 3 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆63Updated 5 months ago
- Repository for deepdoctection tutorial notebooks☆40Updated last month
- Using open source LLMs to build synthetic datasets for direct preference optimization☆49Updated 10 months ago
- Mixtral finetuning☆19Updated 11 months ago
- ☆62Updated 5 months ago
- Knowledge Graph Generator app☆30Updated 9 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 3 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 9 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆119Updated last year
- ☆21Updated 10 months ago
- ☆24Updated last year
- ☆46Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- PyTorch implementation for MRL☆18Updated 10 months ago
- GLiNER model in a FastAPI microservice.☆34Updated last month
- ☆76Updated 7 months ago
- python package to parse pdfs with different parsers☆32Updated last month
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 9 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆59Updated 4 months ago
- ☆57Updated 6 months ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆25Updated last year
- Code/data for MARG (multi-agent review generation)☆36Updated 2 months ago
- ☆27Updated 3 months ago
- ☆47Updated last month
- Generalist and Lightweight Model for Text Classification☆58Updated 2 weeks ago
- Viewer for the structure extracted by Grobid on PDF documents☆44Updated last week
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆63Updated 11 months ago