godweiyang / ls-gpt2-demoLinks
A demo of GPT2 model trained and infered with LightSeq
☆10Updated 2 years ago
Alternatives and similar repositories for ls-gpt2-demo
Users that are interested in ls-gpt2-demo are comparing it to the libraries listed below
Sorting:
- 逻辑回归和单层softmax的解析解☆12Updated 3 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Updated 2 years ago
- Python下shuffle几百G文件☆33Updated 3 years ago
- ☆23Updated 2 years ago
- stitch multiple images into one big image☆18Updated 2 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Updated 2 years ago
- 一些RNN的实现☆50Updated 2 years ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Updated last year
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Updated 2 years ago
- Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.☆20Updated 2 years ago
- A small framework mimics PyTorch using CuPy or NumPy☆37Updated 3 years ago
- ☆22Updated last year
- ☆52Updated 4 years ago
- Virtual Adversarial Training (VAT) techniques in PyTorch☆17Updated 2 years ago
- adafactor optimizer for keras☆20Updated 3 years ago
- KuaiSearch PERKS☆11Updated 3 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆18Updated 2 years ago
- [ICLR 2022] Code for paper "Exploring Extreme Parameter Compression for Pre-trained Language Models"(https://arxiv.org/abs/2205.10036)☆22Updated 2 years ago
- Bi-Directional Attention Flow for Machine Comprehensions☆9Updated 7 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 2 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Updated 2 years ago
- ☆13Updated 4 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- 无监督文本生成的一些方法☆48Updated 4 years ago
- code and data for paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"☆24Updated 3 years ago
- Manages vllm-nccl dependency☆17Updated last year
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Updated 3 years ago
- A Translation Task using TurboTransformers☆11Updated 4 years ago
- ☆11Updated 3 years ago