nlp-uoregon / Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
☆89Updated last year
Related projects: ⓘ
- Multilingual Large Language Models Evaluation Benchmark☆91Updated 3 weeks ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 5 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆118Updated 6 months ago
- A Multilingual Replicable Instruction-Following Model☆91Updated last year
- ☆73Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆80Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆62Updated 6 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆91Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆65Updated 4 months ago
- Code for Zero-Shot Tokenizer Transfer☆109Updated 2 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆63Updated last week
- ☆118Updated 5 months ago
- ☆160Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆69Updated 6 months ago
- ☆94Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆159Updated 11 months ago
- ☆65Updated last year
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆29Updated 6 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆91Updated last year
- contrastive decoding☆174Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆77Updated last month
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆68Updated last month
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆174Updated last week
- ☆105Updated this week
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆109Updated last week
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆102Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated last year
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆85Updated last month
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆118Updated last year
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆29Updated 6 months ago