davendw49 / sciparserLinks
PDF parsing toolkit for preparing academic text corpus
☆61Updated last year
Alternatives and similar repositories for sciparser
Users that are interested in sciparser are comparing it to the libraries listed below
Sorting:
- Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024☆207Updated last year
- All in one PDF Parser Toolkit☆16Updated 2 years ago
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆53Updated last year
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆137Updated last year
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆231Updated last month
- Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"☆39Updated last year
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆80Updated 8 months ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆103Updated last year
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated last year
- [ACL 2023] Plug-and-Play Knowledge Injection for Pre-trained Language Models☆61Updated last year
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆210Updated last year
- ☆147Updated last year
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆136Updated 2 years ago
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆53Updated last year
- LLM for Scientific Research Survey☆118Updated last year
- ☆87Updated 2 years ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆86Updated last year
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆409Updated last year
- The report of a fine-tuned GPT model unifying tables, natural language, and commands.☆110Updated 2 years ago
- ☆84Updated last year
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆198Updated last year
- ☆58Updated last year
- Official Implementation of ACL 2021 paper “GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning”.☆73Updated 4 years ago
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆41Updated 2 years ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆60Updated last year
- Code for "A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction"☆15Updated last year
- MathEval is a benchmark dedicated to the holistic evaluation on mathematical capacities of LLMs.☆86Updated last year
- Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"☆103Updated 2 years ago
- TianGong-AI-Unstructure☆69Updated 3 months ago