An introduction to basic concepts of Transformers and key techniques of their recent advances.
☆52Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for Introduction-to-Transformers
Users that are interested in Introduction-to-Transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of conferences and journals relevant to machine translation☆33Mar 17, 2022Updated 4 years ago
- Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation☆28Jun 30, 2025Updated 10 months ago
- A benchmark for testing memorization abilities of LMs☆24Oct 15, 2024Updated last year
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.☆28Nov 8, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Jul 26, 2021Updated 4 years ago
- NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University …☆163Jul 17, 2024Updated last year
- The implementation of "Shallow-to-Deep Training for Neural Machine Translation"☆10Oct 26, 2020Updated 5 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Jul 25, 2024Updated last year
- ☆21Feb 13, 2023Updated 3 years ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆121Jun 18, 2025Updated 10 months ago
- A PyTorch native platform for training generative AI models☆17Apr 21, 2026Updated last week
- 东北大学校园网关客户端☆182Oct 7, 2024Updated last year
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Feb 6, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆113Mar 31, 2025Updated last year
- ☆10Sep 29, 2017Updated 8 years ago
- ☆10Mar 22, 2024Updated 2 years ago
- ☆26Dec 3, 2023Updated 2 years ago
- sougou医学词库爬取☆13Nov 21, 2019Updated 6 years ago
- ☆10Oct 4, 2022Updated 3 years ago
- The code of paper Affective Decoding for Empathetic Response Generation☆11Oct 12, 2021Updated 4 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 东北大学本科毕业设计 论文latex模板 适应2021届新版书写印制规范 针对计算机类专业☆11Apr 21, 2021Updated 5 years ago
- Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)☆12Oct 18, 2022Updated 3 years ago
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- 《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models☆2,792Sep 14, 2024Updated last year
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆20Mar 1, 2023Updated 3 years ago
- The official implementation for the paper Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense Knowledge.☆15Aug 14, 2023Updated 2 years ago
- ☆14Jul 26, 2021Updated 4 years ago
- ☆14Oct 28, 2022Updated 3 years ago
- This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).☆21Jul 2, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 医疗系统知识图谱问答☆12Feb 3, 2022Updated 4 years ago
- [ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation☆14Jul 11, 2023Updated 2 years ago
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆28Nov 20, 2025Updated 5 months ago
- A Java-based SPARQL query generator☆12Apr 13, 2024Updated 2 years ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆19Oct 24, 2024Updated last year
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Nov 2, 2022Updated 3 years ago
- ☆12Mar 12, 2024Updated 2 years ago