EastTower16 / LLMDataDistill
distill large scale web page text
☆12Updated last year
Related projects: ⓘ
- ☆50Updated this week
- ☆57Updated this week
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆50Updated last year
- Finetune CPM-1☆24Updated 3 years ago
- ☆16Updated this week
- ☆18Updated 3 months ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆18Updated 2 years ago
- Pytorch implementation of models described in "Grounded compositional outputs for adaptive language modeling", EMNLP 2020.☆18Updated 3 years ago
- ☆15Updated last year
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆25Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆27Updated this week
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆25Updated last year
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆23Updated 9 months ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Updated 2 years ago
- Code for EMNLP-2018 paper "Variational Autoregressive Decoder for Neural Response Generation"☆16Updated 4 years ago
- Staged Training for Transformer Language Models☆28Updated 2 years ago
- Unifew: Unified Fewshot Learning Model☆18Updated 3 years ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆30Updated 2 years ago
- ☆92Updated last year
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Updated 3 years ago
- WebRED is a large and diverse manually annotated dataset for extracting relationships from a variety of text found on the World Wide Web.☆20Updated 3 years ago
- Implementation of NeurIPS 20 paper: Latent Template Induction with Gumbel-CRFs☆56Updated 3 years ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆33Updated 6 months ago
- Implementation of "Modeling Past and Future for Neural Machine Translation"☆15Updated 6 years ago
- ROUGE for multilingual Summarization☆23Updated 2 years ago
- SuperCLUE高考作文机器自动阅卷系统☆12Updated last year
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆32Updated 8 months ago
- PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialog…☆27Updated 2 years ago
- ☆16Updated last year