An introduction to basic concepts of Transformers and key techniques of their recent advances.
☆52Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for Introduction-to-Transformers
Users that are interested in Introduction-to-Transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).☆88Jun 2, 2021Updated 4 years ago
- NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University …☆163Jul 17, 2024Updated last year
- The implementation of "Shallow-to-Deep Training for Neural Machine Translation"☆10Oct 26, 2020Updated 5 years ago
- A Fast Neural Machine Translation System developed in C++.☆146Mar 7, 2024Updated 2 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Jul 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- WanJuan3.0(“万卷·丝路”)一个作为综合性的纯文本语料库,采集了多个国家地区的网络公开信息、文献、专利等资料,数据总规模超1.2TB,Token总数超过300B,处于国际领先水平,首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成,每个子集的数据…☆44Feb 13, 2025Updated last year
- ☆21Feb 13, 2023Updated 3 years ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆119Jun 18, 2025Updated 9 months ago
- MT paper lists (by conference)☆124Dec 7, 2020Updated 5 years ago
- 东北大学校园网关客户端☆181Oct 7, 2024Updated last year
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Feb 6, 2024Updated 2 years ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 5 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆112Mar 31, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Mar 22, 2024Updated 2 years ago
- LaTeX Template for Qingdao University Bachelor Degree Thesis☆15Mar 28, 2020Updated 6 years ago
- sougou医学词库爬取☆13Nov 21, 2019Updated 6 years ago
- ☆10Oct 4, 2022Updated 3 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- ☆16Jun 22, 2017Updated 8 years ago
- 东北大学本科毕业设计 论文latex模板 适应2021届新版书写印制规范 针对计算机类专业☆11Apr 21, 2021Updated 4 years ago
- PyTorch code for the IEEE Access paper: SGPT: A Generative Approach for SPARQL Query Generation from Natural Language Questions☆13Sep 15, 2024Updated last year
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Source code for SIGIR 2022 paper.☆16Apr 25, 2022Updated 3 years ago
- Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)☆12Oct 18, 2022Updated 3 years ago
- A pytorch version of Yoon Kim's work(reproduced the Kim's result)☆13Feb 4, 2018Updated 8 years ago
- 《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models☆2,791Sep 14, 2024Updated last year
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last month
- Micromodels -- A framework for accurate, explainable, data efficient, and reusable NLP models.☆14Feb 7, 2023Updated 3 years ago
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆20Mar 1, 2023Updated 3 years ago
- The official implementation for the paper Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense Knowledge.☆15Aug 14, 2023Updated 2 years ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jul 26, 2021Updated 4 years ago
- ☆15Feb 28, 2023Updated 3 years ago
- This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).☆21Jul 2, 2024Updated last year
- [ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation☆14Jul 11, 2023Updated 2 years ago
- 电子科技大学雨课堂刷课☆25Jun 15, 2021Updated 4 years ago
- ☆18Jun 6, 2025Updated 10 months ago
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆28Nov 20, 2025Updated 4 months ago