A large-scale language model for scientific domain, trained on redpajama arXiv split
☆138Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for science-llm
Users that are interested in science-llm are comparing it to the libraries listed below
Sorting:
- LLM for Astronomy[星语4.0]☆312Jul 8, 2025Updated 8 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Apr 7, 2024Updated last year
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 2 years ago
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆164Oct 25, 2023Updated 2 years ago
- Description of the final project in Information Retrieval course 2020.☆10Jan 13, 2021Updated 5 years ago
- 语言模型中文认知能力分析☆236Sep 9, 2023Updated 2 years ago
- Convert HuggingFace code and pretrained models to a PaddlePaddle supported format.☆20Jun 25, 2024Updated last year
- LLM with LuXun (鲁迅) style☆89May 15, 2023Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆315Aug 8, 2024Updated last year
- ☆22Nov 5, 2024Updated last year
- An Instruction-tuned Large Language Model for E-commerce☆267Sep 26, 2023Updated 2 years ago
- ☆164Apr 17, 2023Updated 2 years ago
- ☆22Nov 5, 2024Updated last year
- ☆21Nov 5, 2024Updated last year
- 基于ChatGLM-6B的中文问诊模型☆829Oct 19, 2023Updated 2 years ago
- Robot simulator using web technologies, just JavaScript☆10Feb 13, 2020Updated 6 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago
- search-rattailcollagen1 created by GitHub Classroom☆10Jan 17, 2021Updated 5 years ago
- Repository for a project that transforms PyTorch-based code and models into paddlepaddle-based code and models.☆23Nov 7, 2024Updated last year
- ☆22Jun 30, 2024Updated last year
- ☆22Nov 5, 2024Updated last year
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆229Jun 29, 2023Updated 2 years ago
- Pre-trained Language Model for Scientific Text☆45Feb 22, 2024Updated 2 years ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Jul 9, 2024Updated last year
- Some hash models implemented with paddle☆23Aug 13, 2024Updated last year
- 本仓库包含7个使用Paddle实现的模型☆23Jul 29, 2024Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆10Nov 29, 2024Updated last year
- A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.☆387Dec 12, 2023Updated 2 years ago
- ChatMed: 中文医疗大模型,善于在线回答患者/用户的日常医疗相关问题!☆615Jul 16, 2023Updated 2 years ago
- 根据维基百科历史编辑数据提取纠错语料。☆12Apr 6, 2022Updated 3 years ago
- PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese☆390Jan 23, 2024Updated 2 years ago
- Hacky implementation of ppjoin by Chuan Xia et Al☆19Aug 24, 2014Updated 11 years ago
- ☆14Apr 20, 2025Updated 10 months ago
- MRT: Tracing the Evolution of Scientific Publications (TKDE 2021)☆18Mar 23, 2023Updated 2 years ago
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- ☆35Jan 19, 2026Updated last month
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Feb 23, 2020Updated 6 years ago