flbbb / locost-summarization
☆27Updated last year
Alternatives and similar repositories for locost-summarization:
Users that are interested in locost-summarization are comparing it to the libraries listed below
- Official implementation of "GPT or BERT: why not both?"☆52Updated last month
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 10 months ago
- ☆72Updated 11 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- Code for Zero-Shot Tokenizer Transfer☆127Updated 3 months ago
- Embedding Recycling for Language models☆38Updated last year
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆100Updated last month
- ☆21Updated 2 years ago
- Exploration of automated dataset selection approaches at large scales.☆37Updated last month
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 3 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆45Updated 4 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆72Updated 8 months ago
- ☆27Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆26Updated last year
- MEXMA: Token-level objectives improve sentence representations☆40Updated 3 months ago
- Efficient Transformers with Dynamic Token Pooling☆60Updated last year
- ☆17Updated 6 months ago
- ☆13Updated 6 months ago
- ☆20Updated 2 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆16Updated last year
- ☆19Updated last year
- Few-shot Learning with Auxiliary Data☆27Updated last year
- Evaluation pipeline for the BabyLM Challenge 2023.☆75Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆57Updated 10 months ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆38Updated 3 weeks ago
- LTG-Bert☆32Updated last year
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆30Updated 9 months ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆42Updated 6 months ago
- Transformers at any scale