cybertronai / bflm
☆16Updated 5 years ago
Alternatives and similar repositories for bflm
Users that are interested in bflm are comparing it to the libraries listed below
Sorting:
- Performance Prediction for NLP Tasks☆16Updated 5 years ago
- ☆14Updated last year
- Code repo for "Transformer on a Diet" paper☆31Updated 4 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated 2 years ago
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆65Updated 5 months ago
- Standalone pre-training recipe with JAX+Flax☆31Updated 2 years ago
- Techniques used to run BLOOM at inference in parallel☆37Updated 2 years ago
- ☆12Updated 3 years ago
- ☆47Updated 4 years ago
- ☆34Updated last year
- GASP! Dataset - Generating Abstracts of Scientific Papers from Abstracts of Cited Papers☆9Updated 5 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆53Updated 2 years ago
- Transformers at any scale☆41Updated last year
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Updated 2 years ago
- ☆96Updated last year
- ☆50Updated last year
- ☆72Updated last year
- Make triton easier☆47Updated 11 months ago
- Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs☆18Updated 3 years ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Updated last year
- ☆67Updated 2 years ago
- lanmt ebm☆11Updated 4 years ago
- ☆13Updated 4 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Updated 2 years ago
- Staged Training for Transformer Language Models☆32Updated 3 years ago
- ☆20Updated 11 months ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆19Updated 2 years ago
- A diff tool for language models☆42Updated last year