An introduction to basic concepts of Transformers and key techniques of their recent advances.
☆52Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for Introduction-to-Transformers
Users that are interested in Introduction-to-Transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of conferences and journals relevant to machine translation☆33Mar 17, 2022Updated 4 years ago
- A benchmark for testing memorization abilities of LMs☆24Oct 15, 2024Updated last year
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).☆87Jun 2, 2021Updated 4 years ago
- NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University …☆163Jul 17, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The implementation of "Shallow-to-Deep Training for Neural Machine Translation"☆10Oct 26, 2020Updated 5 years ago
- ☆10Mar 18, 2025Updated last year
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Jul 25, 2024Updated last year
- ☆21Feb 13, 2023Updated 3 years ago
- ☆86Mar 24, 2023Updated 3 years ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆123Jun 18, 2025Updated 11 months ago
- MT paper lists (by conference)☆124Dec 7, 2020Updated 5 years ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Feb 6, 2024Updated 2 years ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Parameter-Efficient Fine-Tuning for Foundation Models☆113Mar 31, 2025Updated last year
- Record my paper reading about Machine Translation and other related works.☆36Nov 19, 2021Updated 4 years ago
- ☆10Mar 22, 2024Updated 2 years ago
- sougou医学词库爬取☆13Nov 21, 2019Updated 6 years ago
- The code of paper Affective Decoding for Empathetic Response Generation☆11Oct 12, 2021Updated 4 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Source code for SIGIR 2022 paper.☆16Apr 25, 2022Updated 4 years ago
- ☆18Mar 27, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- Multilingual Compositional Wikidata Questions (MCWQ)☆20Jun 12, 2023Updated 2 years ago
- A pytorch version of Yoon Kim's work(reproduced the Kim's result)☆13Feb 4, 2018Updated 8 years ago
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆20Mar 1, 2023Updated 3 years ago
- The official implementation for the paper Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense Knowledge.☆15Aug 14, 2023Updated 2 years ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- ☆14Jul 26, 2021Updated 4 years ago
- ☆15Feb 28, 2023Updated 3 years ago
- This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).☆21Jul 2, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 医疗系统知识图谱问答☆12Feb 3, 2022Updated 4 years ago
- [ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation☆14Jul 11, 2023Updated 2 years ago
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆29Nov 20, 2025Updated 6 months ago
- Study of protein N-terminal acetylation modification sites based on CNN-BiLSTM-Attention model☆13Dec 24, 2023Updated 2 years ago
- ☆12Mar 12, 2024Updated 2 years ago
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Nov 2, 2022Updated 3 years ago
- 官方网址:https://learn.lianglianglee.com/ 其中文章说的特别好,fork一份,以留备用☆13Sep 6, 2022Updated 3 years ago