Acemap / pdf_parserLinks
All in one PDF Parser Toolkit
☆16Updated 2 years ago
Alternatives and similar repositories for pdf_parser
Users that are interested in pdf_parser are comparing it to the libraries listed below
Sorting:
- PDF parsing toolkit for preparing academic text corpus☆61Updated last year
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆53Updated last year
- Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024☆207Updated last year
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆34Updated last year
- Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"☆39Updated last year
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆103Updated last year
- Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Framework☆14Updated 2 years ago
- Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"☆17Updated last year
- Recommender for suggesting letter writers 👍☆34Updated last year
- ☆31Updated last year
- Virtual Adversarial Training (VAT) techniques in PyTorch☆17Updated 3 years ago
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆53Updated last year
- Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data☆21Updated last year
- A trainable user simulator☆34Updated 7 months ago
- ACL 2023 (Findings) - BertNet: Harvesting Knowledge Graphs from Pretrained Language Models☆107Updated last year
- ☆18Updated 6 months ago
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆74Updated last year
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆34Updated 9 months ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Updated last year
- The code used to train and run inference with MMDocIR☆32Updated 8 months ago
- ☆39Updated last year
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆28Updated last year
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.☆38Updated last year
- This repo explores how AMR to address tasks difficult for LLMs☆13Updated 2 years ago
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆41Updated 2 years ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆76Updated last year
- The unified platform for data-related resources.☆135Updated 2 years ago
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated last year
- a curated list of the role of small models in the LLM era☆111Updated last year
- EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!☆46Updated last year