An introduction to basic concepts of Transformers and key techniques of their recent advances.
☆52Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for Introduction-to-Transformers
Users that are interested in Introduction-to-Transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of conferences and journals relevant to machine translation☆33Mar 17, 2022Updated 4 years ago
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).☆87Jun 2, 2021Updated 5 years ago
- A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.☆28Nov 8, 2025Updated 7 months ago
- NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University …☆163Jul 17, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 关于编译原理的作业☆10Apr 30, 2020Updated 6 years ago
- A Fast Neural Machine Translation System developed in C++.☆146Mar 7, 2024Updated 2 years ago
- ☆21Feb 13, 2023Updated 3 years ago
- ☆89Mar 24, 2023Updated 3 years ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆122Jun 18, 2025Updated 11 months ago
- MT paper lists (by conference)☆124Dec 7, 2020Updated 5 years ago
- A PyTorch native platform for training generative AI models☆17Apr 21, 2026Updated last month
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Feb 6, 2024Updated 2 years ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 7 months ago
- The code of COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities. https://aclanthology…☆12Oct 12, 2022Updated 3 years ago
- ☆10Sep 29, 2017Updated 8 years ago
- ☆10Mar 22, 2024Updated 2 years ago
- ☆26Dec 3, 2023Updated 2 years ago
- sougou医学词库爬取☆13Nov 21, 2019Updated 6 years ago
- The code of paper Affective Decoding for Empathetic Response Generation☆11Oct 12, 2021Updated 4 years ago
- 东北大学本科毕业设计 论文latex模板 适应2021届新版书写印制规范 针对计算机类专业☆11Apr 21, 2021Updated 5 years ago
- PyTorch code for the IEEE Access paper: SGPT: A Generative Approach for SPARQL Query Generation from Natural Language Questions☆13Sep 15, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Source code for SIGIR 2022 paper.☆16Apr 25, 2022Updated 4 years ago
- ☆15May 29, 2021Updated 5 years ago
- Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)☆12Oct 18, 2022Updated 3 years ago
- ☆18Mar 27, 2020Updated 6 years ago
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated last month
- Software Agents Pacman project, implementing AI for Pacman in Python and PDDL☆12Oct 12, 2015Updated 10 years ago
- The official implementation for the paper Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense Knowledge.☆15Aug 14, 2023Updated 2 years ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- ☆15Feb 28, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation☆14Jul 11, 2023Updated 2 years ago
- ☆19Jun 6, 2025Updated last year
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆29Nov 20, 2025Updated 6 months ago
- A Java-based SPARQL query generator☆12Apr 13, 2024Updated 2 years ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆19Oct 24, 2024Updated last year
- Study of protein N-terminal acetylation modification sites based on CNN-BiLSTM-Attention model☆13Dec 24, 2023Updated 2 years ago
- Praat scripting入门☆15Apr 8, 2025Updated last year