davendw49 / sciparser
PDF parsing toolkit for preparing academic text corpus
☆55Updated 9 months ago
Alternatives and similar repositories for sciparser:
Users that are interested in sciparser are comparing it to the libraries listed below
- Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024☆190Updated 10 months ago
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆53Updated 9 months ago
- All in one PDF Parser Toolkit☆16Updated last year
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆50Updated last year
- LLM for Scientific Research Survey☆81Updated 3 months ago
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆133Updated last year
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆80Updated last year
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆60Updated last year
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆105Updated last year
- Code for "A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction"☆14Updated last year
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆60Updated 9 months ago
- Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"☆32Updated 9 months ago
- ☆55Updated 6 months ago
- Code/data for MARG (multi-agent review generation)☆42Updated 5 months ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆53Updated last year
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆46Updated 5 months ago
- TianGong-AI-Unstructure☆63Updated last week
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆48Updated 10 months ago
- ☆22Updated 8 months ago
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆24Updated last year
- ☆143Updated 9 months ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated last year
- ☆81Updated last year
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆59Updated last year
- ☆31Updated last year
- Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models☆25Updated 8 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆48Updated last year
- ☆67Updated last year
- ☆12Updated last year
- This repository contains ScholarQABench data and evaluation pipeline.☆71Updated 2 weeks ago