hitachi-nlp / appjsonify
A handy PDF-to-JSON conversion tool for academic papers implemented in Python.
☆59Updated last year
Related projects ⓘ
Alternatives and complementary repositories for appjsonify
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated last month
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆25Updated 2 months ago
- ☆41Updated last month
- General solution to archetype LLM batch use case☆31Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆43Updated 2 months ago
- End-to-end zero-shot entity and relation extraction☆58Updated 3 months ago
- Mixtral finetuning☆19Updated 9 months ago
- ☆19Updated last month
- ☆30Updated 7 months ago
- To automate the SLR process and write paper quickly using multi agents of AI☆29Updated 8 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated 10 months ago
- ☆24Updated last year
- Repository for deepdoctection tutorial notebooks☆39Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Scientific Document Insight Q/A☆23Updated this week
- ☆46Updated 9 months ago
- ☆82Updated 6 months ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…☆52Updated last year
- ☆47Updated 2 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆47Updated 2 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆44Updated 3 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆98Updated 10 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆23Updated 3 months ago
- Code for "Training-free Graph Neural Networks and the Power of Labels as Features" (TMLR 2024)☆49Updated 3 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- PyTorch implementation for MRL☆18Updated 9 months ago
- GLiNER model in a FastAPI microservice.☆30Updated 3 weeks ago