Pre-trained Language Model for Scientific Text
☆45Feb 22, 2024Updated 2 years ago
Alternatives and similar repositories for Awesome-SciLM
Users that are interested in Awesome-SciLM are comparing it to the libraries listed below
Sorting:
- A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery (EMNLP'24)☆643Jun 21, 2025Updated 8 months ago
- ☆17Jun 3, 2024Updated last year
- Open-sourced dialogue foundation model for Chemistry and molecule science☆101May 6, 2025Updated 10 months ago
- Evaluation Pipeline for medical tasks.☆12Feb 13, 2026Updated 3 weeks ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆18Jan 13, 2025Updated last year
- ☆16Sep 4, 2025Updated 6 months ago
- ☆11Jan 3, 2024Updated 2 years ago
- ☆11Jun 4, 2021Updated 4 years ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- Solution of KDD cup 2021☆11Jun 16, 2021Updated 4 years ago
- A toolkit to automatically crawl the paper list and download paper pdfs of ACL Ahthology.☆10Nov 12, 2025Updated 3 months ago
- To be readable without enhancing english power.☆10Jul 22, 2020Updated 5 years ago
- ☆16May 31, 2024Updated last year
- 该仓库主要记录 NLP 算法工程师相关的 搜索引擎 学习笔记☆13Apr 9, 2022Updated 3 years ago
- [NeurIPS 24] Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation☆18Jan 2, 2026Updated 2 months ago
- script to evaluate pre-trained Japanese word2vec model on Japanese similarity dataset☆12Nov 4, 2024Updated last year
- ☆35Jan 19, 2026Updated last month
- ☆15Mar 15, 2022Updated 3 years ago
- ☆19Aug 5, 2024Updated last year
- Awesome Long-CoT Data☆18Mar 26, 2025Updated 11 months ago
- BERT models pretrained on the CORD-19 Kaggle dataset☆15Jun 8, 2020Updated 5 years ago
- DefSent: Sentence Embeddings using Definition Sentences☆22Aug 5, 2021Updated 4 years ago
- ☆20Feb 26, 2021Updated 5 years ago
- Elixir: Train a Large Language Model on a Small GPU Cluster☆15Jun 8, 2023Updated 2 years ago
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆26Jul 13, 2025Updated 7 months ago
- ☆17May 31, 2023Updated 2 years ago
- ☆23Sep 27, 2024Updated last year
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated 10 months ago
- ☆22Apr 15, 2022Updated 3 years ago
- Easy-to-use scripts to fine-tune GPT-2-JA with your own texts, to generate sentences, and to tweet them automatically.☆19Aug 26, 2025Updated 6 months ago
- A library for semantic similarity search☆26Jan 31, 2025Updated last year
- [ACL 2024] ReactXT: Understanding Molecular “Reaction-ship” via Reaction-Contextualized Molecule-Text Pretraining. by Zhiyuan Liu*, Yaoru…☆28Sep 3, 2024Updated last year
- [EMNLP 2023] ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction.☆22Jan 28, 2024Updated 2 years ago
- AI4Chem is a code to test the ability of large language models (ChatGPT) to comprehend Chemistry.☆23Aug 5, 2025Updated 7 months ago
- 日本語フェイクニュースデータセット☆20May 2, 2021Updated 4 years ago
- python版日本語意味役割付与システム(ASA)☆22Nov 11, 2022Updated 3 years ago
- An annotation tool for grounding of formulae☆24May 28, 2024Updated last year
- Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)☆20Jun 17, 2025Updated 8 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆28Nov 18, 2024Updated last year