Acemap / pdf_parserLinks
All in one PDF Parser Toolkit
☆16Updated last year
Alternatives and similar repositories for pdf_parser
Users that are interested in pdf_parser are comparing it to the libraries listed below
Sorting:
- PDF parsing toolkit for preparing academic text corpus☆61Updated last year
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆53Updated last year
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆104Updated last year
- Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data☆20Updated last year
- ☆17Updated 2 weeks ago
- ACL 2023 (Findings) - BertNet: Harvesting Knowledge Graphs from Pretrained Language Models☆106Updated last year
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆39Updated last year
- Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024☆197Updated last year
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆53Updated last year
- Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"☆17Updated last year
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆53Updated 9 months ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆12Updated last year
- Code for our paper "Graph Language Models"☆73Updated 11 months ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆61Updated 2 years ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Updated last year
- A extension of Transformers library to include T5ForSequenceClassification class.☆39Updated 2 years ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆28Updated 9 months ago
- ☆78Updated 10 months ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆74Updated last year
- [ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆42Updated 2 months ago
- The github repository for the paper at COLING 2025: Retrieval Augmented Instruction Tuning for Open NER with Large Language Models.☆22Updated last year
- This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.☆73Updated 2 years ago
- [ACL-24 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"☆54Updated last year
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆40Updated 4 months ago
- Code/data for MARG (multi-agent review generation)☆47Updated 8 months ago
- a curated list of the role of small models in the LLM era☆103Updated 10 months ago
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated 11 months ago
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆20Updated 9 months ago
- We want to try and evaluate LLMs using Knowledge Graphs☆106Updated 2 years ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆31Updated 3 months ago