davendw49 / k2
Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024
☆170Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for k2
- PDF parsing toolkit for preparing academic text corpus☆49Updated 4 months ago
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆45Updated 4 months ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆97Updated 7 months ago
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆122Updated 8 months ago
- A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Char…☆166Updated 3 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆169Updated this week
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆145Updated last month
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆373Updated last week
- [Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering☆187Updated 5 months ago
- All in one PDF Parser Toolkit☆14Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆131Updated 4 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆144Updated 3 weeks ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆190Updated 3 weeks ago
- A Toolkit for Table-based Question Answering☆105Updated last year
- ☆192Updated 6 months ago
- ☆129Updated 4 months ago
- Official Repo of paper "KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction". In the paper, we propose …☆58Updated 3 months ago
- MGeo: Multi-Modal Geographic Language Model Pre-Training☆65Updated last year
- ☆119Updated 9 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆214Updated last year
- ☆91Updated 11 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆124Updated 2 months ago
- TianGong-AI-Unstructure☆51Updated this week
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆155Updated 7 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆496Updated 5 months ago
- MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models☆252Updated 5 months ago
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆54Updated last week
- 中文原生检索增强生成测评基准☆99Updated 6 months ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆253Updated 3 months ago
- The report of a fine-tuned GPT model unifying tables, natural language, and commands.☆100Updated 11 months ago