An introduction to basic concepts of Transformers and key techniques of their recent advances.
☆52Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for Introduction-to-Transformers
Users that are interested in Introduction-to-Transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of conferences and journals relevant to machine translation☆33Mar 17, 2022Updated 4 years ago
- Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation☆30Jun 30, 2025Updated last year
- A benchmark for testing memorization abilities of LMs☆24Oct 15, 2024Updated last year
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).☆87Jun 2, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Jul 26, 2021Updated 4 years ago
- NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University …☆163Jul 17, 2024Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆91Nov 13, 2024Updated last year
- ☆10Mar 18, 2025Updated last year
- A Fast Neural Machine Translation System developed in C++.☆147Mar 7, 2024Updated 2 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Jul 25, 2024Updated last year
- ☆21Feb 13, 2023Updated 3 years ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆122Jun 18, 2025Updated last year
- MT paper lists (by conference)☆124Dec 7, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 东北大学校园网关客户端☆184Oct 7, 2024Updated last year
- ☆16Nov 28, 2023Updated 2 years ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 5 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 7 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆112Mar 31, 2025Updated last year
- The code of COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities. https://aclanthology…☆12Oct 12, 2022Updated 3 years ago
- ☆10Mar 22, 2024Updated 2 years ago
- ☆27Dec 3, 2023Updated 2 years ago
- ☆10Oct 4, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The code of paper Affective Decoding for Empathetic Response Generation☆11Oct 12, 2021Updated 4 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- 东北大学本科毕业设计 论文latex模板 适应2021届新版书写印制规范 针对计算机类专业☆11Apr 21, 2021Updated 5 years ago
- PyTorch code for the IEEE Access paper: SGPT: A Generative Approach for SPARQL Query Generation from Natural Language Questions☆13Sep 15, 2024Updated last year
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)☆12Oct 18, 2022Updated 3 years ago
- ☆18Mar 27, 2020Updated 6 years ago
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated 2 months ago
- Software Agents Pacman project, implementing AI for Pacman in Python and PDDL☆12Oct 12, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Micromodels -- A framework for accurate, explainable, data efficient, and reusable NLP models.☆14Feb 7, 2023Updated 3 years ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- ☆14Jul 26, 2021Updated 4 years ago
- ☆14Oct 28, 2022Updated 3 years ago
- ☆15Feb 28, 2023Updated 3 years ago
- This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).☆21Jul 2, 2024Updated last year
- 医疗系统知识图谱问答☆12Feb 3, 2022Updated 4 years ago