Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
☆96Aug 18, 2023Updated 2 years ago
Alternatives and similar repositories for Okapi
Users that are interested in Okapi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⚡ LLaMA-2 model experiment☆12Nov 22, 2023Updated 2 years ago
- Multilingual Large Language Models Evaluation Benchmark☆132Aug 21, 2024Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆29Jul 24, 2023Updated 2 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆150Dec 31, 2024Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26May 14, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- ☆78May 4, 2024Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆72Mar 6, 2024Updated 2 years ago
- ☆20Apr 8, 2025Updated last year
- ☆17Jul 10, 2022Updated 3 years ago
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 7 months ago
- ☆18Nov 25, 2022Updated 3 years ago
- ☆26Jan 28, 2024Updated 2 years ago
- Knowledge Graph-augmented NMT☆11Sep 20, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆82Apr 11, 2024Updated 2 years ago
- Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation☆26Sep 30, 2022Updated 3 years ago
- Transformation spoken text to written text☆31May 14, 2024Updated last year
- Pre-training script for BART in JAX/Flax☆38Aug 4, 2022Updated 3 years ago
- restore tone for missing tone sentences☆13Jul 29, 2019Updated 6 years ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆104Jul 22, 2024Updated last year
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 10 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- VnDT: A Vietnamese Dependency Treebank☆24Nov 6, 2021Updated 4 years ago
- ☆18Feb 25, 2025Updated last year
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆338Dec 18, 2024Updated last year
- ☆16Dec 14, 2022Updated 3 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- Use LoRA technique to improve training Large Language Model☆13Jul 25, 2023Updated 2 years ago
- ACL 2023 Dual-Alignment Pre-training for Cross-lingual Sentence Embedding☆24Aug 21, 2024Updated last year
- Emotion-Aware Dialogue Response Generation by Multi-Task Learning☆13Jan 22, 2022Updated 4 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆110Dec 25, 2022Updated 3 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- ☆11Feb 24, 2022Updated 4 years ago
- ☆11Oct 3, 2022Updated 3 years ago
- Showcasing various NLP Downstream tasks Training with pre-trained Language models using Pytorch Lightning☆13Aug 7, 2022Updated 3 years ago