davendw49 / k2
Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024
☆171Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for k2
- PDF parsing toolkit for preparing academic text corpus☆49Updated 4 months ago
- Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"☆19Updated 4 months ago
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆46Updated 4 months ago
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆375Updated 2 weeks ago
- [Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering☆188Updated 5 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆174Updated this week
- MGeo: Multi-Modal Geographic Language Model Pre-Training☆67Updated last year
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆123Updated 8 months ago
- ☆119Updated 9 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆191Updated last month
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆133Updated 3 months ago
- ☆129Updated 4 months ago
- A Toolkit for Table-based Question Answering☆105Updated last year
- All in one PDF Parser Toolkit☆14Updated last year
- TechGPT 2.0: Technology-Oriented Generative Pretrained Transformer 2.0☆101Updated 2 months ago
- UrbanKGent is an urban knowledge graph construction agent.☆27Updated last month
- TianGong-AI-Unstructure☆51Updated this week
- [ACL 2024] OceanGPT: A Large Language Model for Ocean Science Tasks☆33Updated 3 months ago
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆150Updated last month
- 智海三乐-教育大模型☆35Updated last year
- ☆83Updated 2 weeks ago
- ☆130Updated 6 months ago
- KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆173Updated last month
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆132Updated 5 months ago
- ☆194Updated 7 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆308Updated 2 months ago
- ☆120Updated 7 months ago
- Official Repo of paper "KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction". In the paper, we propose …☆59Updated 3 months ago
- Code for "A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction"☆11Updated 8 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆156Updated 7 months ago