zinccat / dolly_chineseLinks
Translation of the databricks-dolly-15k dataset to Chinese for commercial use.
☆19Updated 2 years ago
Alternatives and similar repositories for dolly_chinese
Users that are interested in dolly_chinese are comparing it to the libraries listed below
Sorting:
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Updated 2 years ago
- MOSS 003 WebSearchTool: A simple but reliable implementation☆45Updated 2 years ago
- Latest Evaluation Toolkit (LatestEval). Assessing the language models with latest, uncontaminated materials.☆25Updated 5 months ago
- backend for fastnlp MOSS project☆59Updated last year
- Gaokao Benchmark for AI☆108Updated 3 years ago
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Updated 2 years ago
- A preliminary evaluation of ChatGPT/GPT-4 for machine translation.☆248Updated 3 months ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- A unified tokenization tool for Images, Chinese and English.☆151Updated 2 years ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆32Updated 6 months ago
- ☆59Updated last year
- 大规模中文语料☆42Updated 5 years ago
- ☆218Updated 2 years ago
- Calculate the probability of a paper being accepted by EMNLP2023 based on score distribution of ACL2023.☆14Updated last year
- Feeling confused about super alignment? Here is a reading list☆43Updated last year
- Data for paper "CC-Riddle: A Question Answering Dataset of Chinese Character Riddles": https://arxiv.org/abs/2206.13778☆18Updated last year
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆104Updated 2 years ago
- Perform crosstalk with Qian Yu☆54Updated last year
- ROUGE for multilingual Summarization☆25Updated 3 years ago
- Evaluating LLMs with Dynamic Data☆93Updated 2 months ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆41Updated 3 weeks ago
- ⚡Research papers about leveraging the capabilities of language models⚡☆52Updated 2 years ago
- 中文大语言模型评测第一期☆109Updated last year
- machine translation data process tools☆10Updated last year
- ☆12Updated 8 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- ☆77Updated last year
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 2 years ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆33Updated last year
- ☆31Updated 2 years ago