hitachi-nlp / appjsonifyLinks
A handy PDF-to-JSON conversion tool for academic papers implemented in Python.
☆68Updated last year
Alternatives and similar repositories for appjsonify
Users that are interested in appjsonify are comparing it to the libraries listed below
Sorting:
- Repository for deepdoctection tutorial notebooks☆45Updated last month
- Scientific Document Insight Q/A☆29Updated 3 weeks ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 8 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆52Updated 2 months ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆30Updated 10 months ago
- Efficient few-shot learning with cross-encoders.☆54Updated last year
- ☆94Updated last year
- ☆32Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- ☆62Updated 5 months ago
- Aligned, Review-Informed Edits of Scientific Papers☆53Updated 2 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆32Updated 3 months ago
- ☆63Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆131Updated 6 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆97Updated last year
- ☆16Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆62Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 9 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆126Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆90Updated 7 months ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…☆52Updated 2 years ago
- Code for "Training-free Graph Neural Networks and the Power of Labels as Features" (TMLR 2024)☆58Updated 11 months ago
- Gzip and nearest neighbors for text classification☆57Updated last year
- Small python package to measure OCR quality and other related metrics.☆24Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆52Updated 9 months ago
- To automate the SLR process and write paper quickly using multi agents of AI☆45Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆209Updated 5 months ago
- Mixtral finetuning☆19Updated last year
- Knowledge Graph Generator app☆31Updated last year
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago