davendw49 / sciparserLinks
PDF parsing toolkit for preparing academic text corpus
☆58Updated 11 months ago
Alternatives and similar repositories for sciparser
Users that are interested in sciparser are comparing it to the libraries listed below
Sorting:
- Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024☆195Updated last year
- All in one PDF Parser Toolkit☆16Updated last year
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆53Updated 11 months ago
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆133Updated last year
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆50Updated last year
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆79Updated last year
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆103Updated last year
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆61Updated last year
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated last year
- ☆36Updated 9 months ago
- ☆142Updated 11 months ago
- LLM for Scientific Research Survey☆96Updated 5 months ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆60Updated 11 months ago
- A Toolkit for Table-based Question Answering☆112Updated last year
- Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"☆32Updated last year
- ☆57Updated 8 months ago
- ☆124Updated last year
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆39Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆54Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios