davendw49 / sciparserLinks
PDF parsing toolkit for preparing academic text corpus
☆61Updated last year
Alternatives and similar repositories for sciparser
Users that are interested in sciparser are comparing it to the libraries listed below
Sorting:
- Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024☆204Updated last year
- All in one PDF Parser Toolkit☆16Updated 2 years ago
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆53Updated last year
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆136Updated last year
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated last year
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆53Updated 11 months ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆131Updated last year
- [Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering☆195Updated last year
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆204Updated 9 months ago
- A Toolkit for Table-based Question Answering☆113Updated 2 years ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆102Updated last year
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆224Updated 4 months ago
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆72Updated 5 months ago
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆108Updated last week
- TianGong-AI-Unstructure☆69Updated last week
- Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"☆36Updated last year
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆83Updated last year
- The report of a fine-tuned GPT model unifying tables, natural language, and commands.☆110Updated last year
- TechGPT 2.0: Technology-Oriented Generative Pretrained Transformer 2.0☆114Updated last year
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆405Updated 9 months ago
- ☆147Updated last year
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆61Updated last year
- MathEval is a benchmark dedicated to the holistic evaluation on mathematical capacities of LLMs.☆84Updated 11 months ago
- Zero-shot KGQA method based on curiosity-driven graph exploration. Agarwal et al., "Bring Your Own KG: Self-Supervised Program Synthesis …☆28Updated last year
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆135Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 5 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆44Updated last year
- FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering☆78Updated 2 years ago
- ☆58Updated last year
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆51Updated 2 years ago