☆164Apr 17, 2023Updated 2 years ago
Alternatives and similar repositories for RefGPT
Users that are interested in RefGPT are comparing it to the libraries listed below
Sorting:
- ☆98Mar 20, 2024Updated last year
- ☆36Sep 6, 2024Updated last year
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,194May 3, 2025Updated 10 months ago
- ☆21Sep 12, 2023Updated 2 years ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆416Jun 25, 2025Updated 8 months ago
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集☆3,055Apr 14, 2024Updated last year
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,798Dec 12, 2023Updated 2 years ago
- TigerBot: A multi-language multi-task LLM☆2,263Dec 28, 2024Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆24Nov 6, 2024Updated last year
- YuLan-IR: Information Retrieval Boosted LMs☆220Mar 4, 2024Updated 2 years ago
- Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)☆2,789Mar 13, 2024Updated last year
- Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。☆1,036Oct 19, 2023Updated 2 years ago
- An Open-sourced Knowledgable Large Language Model Framework.☆1,375Jan 11, 2025Updated last year
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,281Oct 16, 2024Updated last year
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆256Aug 1, 2023Updated 2 years ago
- [COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集☆659Jun 19, 2023Updated 2 years ago
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆64Oct 12, 2023Updated 2 years ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263May 9, 2024Updated last year
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆120Dec 10, 2024Updated last year
- The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul ope…☆828May 28, 2024Updated last year
- A Unified Toolkit for Deep Learning-Based Table Extraction☆59Nov 21, 2024Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆159Jul 25, 2025Updated 7 months ago
- ☆99Dec 5, 2023Updated 2 years ago
- GlobalPointer的优化版/NER实体识别☆122Jan 29, 2022Updated 4 years ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆162Jul 3, 2023Updated 2 years ago
- 语言模型中文认知能力分析☆236Sep 9, 2023Updated 2 years ago
- GoGPT:基于Llama/Llama 2训练的中英文增强大模型|Chinese-Llama2☆78Oct 7, 2023Updated 2 years ago
- Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。☆1,129Feb 27, 2024Updated 2 years ago
- A Chinese Open-Domain Dialogue System☆326Aug 16, 2023Updated 2 years ago
- Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback☆1,585Nov 24, 2025Updated 3 months ago
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,681Jul 18, 2024Updated last year
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆138Mar 1, 2024Updated 2 years ago
- an intro to retrieval augmented large language model☆306Sep 9, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,635Oct 24, 2024Updated last year
- [COLM'24] "How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?"☆22Oct 13, 2024Updated last year
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 3 months ago
- pCLUE: 1000000+多任务提示学习数据集☆506Oct 4, 2022Updated 3 years ago
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆229Jun 29, 2023Updated 2 years ago