davendw49 / sciparserLinks
PDF parsing toolkit for preparing academic text corpus
☆61Updated last year
Alternatives and similar repositories for sciparser
Users that are interested in sciparser are comparing it to the libraries listed below
Sorting:
- Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024☆198Updated last year
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆53Updated last year
- All in one PDF Parser Toolkit☆16Updated last year
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆68Updated 3 months ago
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆136Updated last year
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆53Updated 9 months ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆103Updated last year
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆200Updated 7 months ago
- [NAACL 2024] End-to-End Beam Retrieval for Multi-Hop Question Answering☆114Updated last year
- [Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering☆193Updated last year
- ☆58Updated 10 months ago
- LLM for Scientific Research Survey☆98Updated 7 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆81Updated last year
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated last year
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆19Updated last year
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆130Updated last year
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆39Updated 2 years ago
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆61Updated last year
- ☆36Updated 11 months ago
- A Toolkit for Table-based Question Answering☆113Updated last year
- [WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.☆108Updated last month
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆107Updated 10 months ago
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆217Updated 2 months ago
- Code/data for MARG (multi-agent review generation)☆49Updated 9 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆244Updated last year
- [ACL-24 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"☆54Updated last year
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆185Updated last year
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆41Updated 4 months ago
- TechGPT 2.0: Technology-Oriented Generative Pretrained Transformer 2.0☆113Updated last year
- ☆83Updated last year