NiuTrans / LMTLinks
Building a inclusive, scalable, and high-performance multilingual translation model
☆119Updated this week
Alternatives and similar repositories for LMT
Users that are interested in LMT are comparing it to the libraries listed below
Sorting:
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆85Updated 6 months ago
- “百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced l…☆317Updated last year
- Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation☆28Updated 6 months ago
- An introduction to basic concepts of Transformers and key techniques of their recent advances.☆51Updated 2 years ago
- 文本去重☆77Updated last year
- ☆96Updated 2 years ago
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆258Updated 6 months ago
- ☆161Updated 5 months ago
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆69Updated 3 months ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆120Updated last year
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆96Updated 11 months ago
- 多轮共情对话模型PICA☆97Updated 2 years ago
- ☆27Updated 2 years ago
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆265Updated 2 years ago
- ☆78Updated 2 years ago
- WritingBench: A Comprehensive Benchmark for Generative Writing☆155Updated last month
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆86Updated last year
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆123Updated 8 months ago
- 更纯粹、更高压缩率的Tokenizer☆488Updated last year
- ☆184Updated 2 years ago
- 基于DPO算法微调语言大模型,简单好上手。☆48Updated last year
- [ACL 2024 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"☆60Updated last year
- We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).☆87Updated 4 years ago
- LLaMA Factory Document☆163Updated 3 weeks ago
- 用于汇总目前的开源中文对话数据集☆199Updated 2 years ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆367Updated last year
- ☆30Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆136Updated last year
- Yet Another Chinese Learner Corpus☆78Updated 4 years ago
- Deep Reasoning Translation (DRT) Project☆240Updated 4 months ago