davendw49 / k2
Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024
☆186Updated 9 months ago
Alternatives and similar repositories for k2:
Users that are interested in k2 are comparing it to the libraries listed below
- Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"☆30Updated 9 months ago
- PDF parsing toolkit for preparing academic text corpus☆55Updated 8 months ago
- PEACE: Empowering Geologic Map Holistic Understanding with MLLMs [Official, CVPR 2025]☆14Updated last week
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆50Updated 8 months ago
- Accompanying repo for 'GPT4GEO: How a Language Model Sees the World's Geography' project☆26Updated last year
- MGeo: Multi-Modal Geographic Language Model Pre-Training☆78Updated last year
- ☆16Updated 6 months ago
- ☆87Updated 7 months ago
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆131Updated last year
- ☆142Updated 9 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆229Updated last month
- [ACL 2024] OceanGPT: A Large Language Model for Ocean Science Tasks☆41Updated last week
- A Toolkit for Table-based Question Answering☆110Updated last year
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆189Updated 2 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆131Updated 8 months ago
- ☆14Updated 9 months ago
- 智海三乐-教育大模型☆46Updated last year
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆194Updated last month
- [ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs☆31Updated 8 months ago
- ☆137Updated 11 months ago
- TechGPT 2.0: Technology-Oriented Generative Pretrained Transformer 2.0☆110Updated 7 months ago
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆398Updated 3 months ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆104Updated last year
- ☆340Updated last month
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆168Updated 7 months ago
- 中文原生检索增强生成测评基准☆113Updated 11 months ago
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆154Updated last year
- ☆81Updated last year
- ☆264Updated 8 months ago
- ☆53Updated 5 months ago