A Multi-tasking and Multi-stage Chinese Minority Pre-Trained Language Model
☆12Jul 24, 2023Updated 2 years ago
Alternatives and similar repositories for CMPT
Users that are interested in CMPT are comparing it to the libraries listed below
Sorting:
- ☆17Oct 8, 2023Updated 2 years ago
- 基于LLAMA2的增量预训练藏文大语言模型Tibetan-LLAMA2-7B&Tibetan-LLAMA2-13B;指令微调藏文大模型Tibetan-Alpaca-7B&Tibetan-Alpaca-13B。☆43Jun 8, 2024Updated last year
- Baidu Qianfan Deep Research☆26Updated this week
- 基于LLaMA2-7B增量预训练的藏文大语言模型TiLamb(Tibetan Large Language Model Base)☆36Apr 3, 2024Updated last year
- [ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)☆31Jan 17, 2026Updated 2 months ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- ☆26Nov 7, 2022Updated 3 years ago
- ☆16Aug 14, 2022Updated 3 years ago
- final-project-level3-nlp-02 created by GitHub Classroom☆11Dec 31, 2021Updated 4 years ago
- Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".☆13May 24, 2022Updated 3 years ago
- code for ACL 2019 paper "cross lingual training for automatic question generation"☆14Jun 30, 2019Updated 6 years ago
- ☆14Aug 30, 2023Updated 2 years ago
- Official code repo for paper: ACROSS: An Alignment-based Framework for Low-Resource Many-to-One Cross-Lingual Summarization☆12Jul 15, 2023Updated 2 years ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆27Sep 19, 2023Updated 2 years ago
- Code and dataset for 'Contrastive Aligned Joint Learning for Multilingual Summarization'☆13Mar 24, 2022Updated 3 years ago
- ☆18Apr 11, 2021Updated 4 years ago
- ☆13Aug 27, 2021Updated 4 years ago
- MergeNet-filter-ldr2hdr, detail in paper 《Reconstructing HDR Image from a Single Filtered LDR Image Base on a Deep HDR Merger Network》☆10Sep 11, 2019Updated 6 years ago
- Finetune t5 and bart on Chinese Grammatical Error Correction data.☆19Aug 24, 2022Updated 3 years ago
- NLP models and codes for BAAI-JD joint project.☆10May 27, 2020Updated 5 years ago
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents (JCDL 2022)☆14Jul 22, 2022Updated 3 years ago
- TIP-LAS: An open source toolkit for Tibetan word segmentation and part-of-speech tagging☆82Nov 11, 2022Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Feb 26, 2026Updated 3 weeks ago
- [Machine Learning 2023] NaCL: Noise-Robust Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation☆12Jul 8, 2023Updated 2 years ago
- One implementation of the paper "Controllable Neural Dialogue Summarization with Personal Named Entity Planning" (EMNLP 2022).☆18Nov 9, 2023Updated 2 years ago
- 古籍识别☆15May 19, 2021Updated 4 years ago
- Source Code For ACL 2021 Paper "Mention Flags (MF): Constraining Transformer-based Text Generators"☆20Oct 4, 2021Updated 4 years ago
- A research of Manchu hypothesis of Voynich manuscript. It's an Oracle database with tabes, DML scripts, PLSQL functions and queries.☆16Jun 11, 2014Updated 11 years ago
- Tibetan to English Machine Translation☆10Dec 24, 2020Updated 5 years ago
- echomimic免环境安装windows一体包,解压即用|echomimic environment-free installation Windows all-in-one package, ready to use after extraction☆20Aug 26, 2024Updated last year
- The codes for ACL2022 paper “CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation☆23Oct 23, 2022Updated 3 years ago
- Pascal2 Harvest project QuEst☆14Sep 15, 2014Updated 11 years ago
- Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"☆18Sep 5, 2022Updated 3 years ago
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Mar 21, 2022Updated 4 years ago
- A Manchu dictionary bot for Telegram.☆12Feb 14, 2021Updated 5 years ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago