NiuTrans / LMTLinks
Building inclusive, scalable, and high-performance multilingual translation.
☆114Updated last week
Alternatives and similar repositories for LMT
Users that are interested in LMT are comparing it to the libraries listed below
Sorting:
- Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation☆27Updated 5 months ago
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆258Updated 5 months ago
- We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).☆86Updated 4 years ago
- “百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced l…☆318Updated last year
- An introduction to basic concepts of Transformers and key techniques of their recent advances.☆51Updated last year
- ☆181Updated 2 years ago
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆84Updated 5 months ago
- 更纯粹、更高压缩率的Tokenizer☆486Updated last year
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆176Updated 11 months ago
- A list of conferences and journals relevant to machine translation☆33Updated 3 years ago
- 基于DPO算法微调语言大模型,简单好上手。☆48Updated last year
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆73Updated last year
- 中文 Instruction tuning datasets☆141Updated last year
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆263Updated 2 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆121Updated 6 months ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆95Updated 9 months ago
- 中文图书语料MD5链接☆218Updated last year
- ☆282Updated last year
- 万卷1.0多模态语料☆569Updated 2 years ago
- Yet Another Chinese Learner Corpus☆78Updated 3 years ago
- Collaborative Training of Large Language Models in an Efficient Way☆417Updated last year
- ☆252Updated last year
- 活字通用大模型☆391Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆136Updated last year
- Python ROUGE Score Implementation for Chinese Language Task (official rouge score)☆111Updated last year
- A Fast Neural Machine Translation System developed in C++.☆144Updated last year
- 用于汇总目前的开源中文对话数据集☆191Updated 2 years ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆253Updated last year
- 文本去重☆77Updated last year
- ☆78Updated 2 years ago