godweiyang / ls-gpt2-demo
A demo of GPT2 model trained and infered with LightSeq
☆10Updated 2 years ago
Related projects: ⓘ
- 逻辑回归和单层softmax的解析解☆12Updated 3 years ago
- Python下shuffle几百G文件☆33Updated 3 years ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆18Updated 3 months ago
- implement bert in pure c++☆30Updated 4 years ago
- A small framework mimics PyTorch using CuPy or NumPy☆27Updated 2 years ago
- KuaiSearch PERKS☆11Updated 2 years ago
- ☆16Updated this week
- adafactor optimizer for keras☆20Updated 3 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆53Updated 3 years ago
- 一些RNN的实现☆47Updated last year
- LGEB: Benchmark of Language Generation Evaluation☆16Updated last year
- lightweighted deep learning inference service framework☆38Updated 3 years ago
- Finetune CPM-1☆24Updated 3 years ago
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆13Updated last week
- stitch multiple images into one big image☆18Updated last year
- ROUGE for multilingual Summarization☆23Updated 2 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆95Updated last year
- OpenLLMDE: An open source data engineering framework for LLMs☆16Updated last year
- InsNet Runs Instance-dependent Neural Networks with Padding-free Dynamic Batching.☆66Updated 2 years ago
- 无监督文本生成的一些方法☆49Updated 3 years ago
- List some datasets in NLP field.☆29Updated 3 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Updated 2 years ago
- ☆23Updated last year
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- ☆52Updated 3 years ago
- 大规模中文语料☆34Updated 4 years ago
- ☆10Updated this week
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆25Updated last year
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- 定时爬取arXiv每日论文☆12Updated last year