๐ค Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
โ17Jun 5, 2025Updated last year
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- โ20Jul 24, 2024Updated last year
- The repository for the course "Astroinformatics" offered at Institute of Astronomy, National Central University, from Sep/2022 to Jan/202โฆโ10Jun 4, 2024Updated 2 years ago
- Difference-based Contrastive Learning for Korean Sentence Embeddingsโ23Mar 11, 2026Updated 3 months ago
- EMNLP 2022: Leveraging Locality in Abstractive Text Summarizationโ18Oct 21, 2024Updated last year
- 0-Shot Tokenizer Transplantโ14May 16, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Modelsโ25Aug 24, 2024Updated last year
- For the rlhf learning environment of Koreansโ25Sep 25, 2023Updated 2 years ago
- Google TPU optimizations for transformers modelsโ137Jan 23, 2026Updated 4 months ago
- Deep Learning Gravity Optimizer Source Code Repositoryโ14Jul 26, 2021Updated 4 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuningโ99Apr 26, 2023Updated 3 years ago
- Pretraining and finetuning for visual instruction following with Mixture of Expertsโ15Jan 30, 2024Updated 2 years ago
- #์ธ๊ถ์ฝํผ์คโ31Oct 6, 2023Updated 2 years ago
- โ14Jul 21, 2022Updated 3 years ago
- huggingface์ ์๋ ํ๊ตญ์ด ๋ฐ์ดํฐ ์ธํธโ36Oct 10, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- โ11Feb 2, 2023Updated 3 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluationโ28Dec 9, 2022Updated 3 years ago
- โ36Oct 4, 2023Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Modelโ45Oct 1, 2025Updated 8 months ago
- โ197May 4, 2026Updated last month
- JAX implementation of the Llama 2 modelโ217Feb 2, 2024Updated 2 years ago
- hllama is a library which aims to provide a set of utility tools for large language models.โ10Apr 16, 2024Updated 2 years ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.โ17Feb 15, 2025Updated last year
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Modelsโ19Apr 1, 2025Updated last year
- End-to-end encrypted email - Proton Mail โข AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- TPU์์ ํ๊ตญ์ด์ฉ LLM ์ถ๋ก ์ ์ํ Jax/Flax ๊ตฌํ์ฒด์ ๋๋ค.โ12Jun 12, 2023Updated 3 years ago
- Take neural networks as APIs for human-like AI.โ20Dec 4, 2019Updated 6 years ago
- BERT score for text generationโ12Jan 15, 2025Updated last year
- Instruction Following Evalโ17Jan 16, 2025Updated last year
- Code for ACL 2018 paper "Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference".โ17Aug 5, 2018Updated 7 years ago
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!โ10Sep 1, 2024Updated last year
- Kor-IR: Korean Information Retrieval Benchmarkโ87Jul 3, 2024Updated last year
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"โ13Jul 23, 2023Updated 2 years ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinโฆโ23Dec 23, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available โข AdRun AI, ML, and HPC workloads on powerful cloud GPUsโwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Testing DeepSpeed integration in ๐ค Accelerateโ11Jun 28, 2022Updated 3 years ago
- Android support library for use Indexed Bitmap(8 bits per pixel).โ12Jun 22, 2017Updated 8 years ago
- โ10Jan 20, 2024Updated 2 years ago
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022โ16Jun 22, 2022Updated 3 years ago
- Like word2vec, except for letters of the alphabet.โ17May 29, 2017Updated 9 years ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesisโ11Feb 17, 2023Updated 3 years ago
- ๐ญ Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"โ13Mar 26, 2024Updated 2 years ago