The test set for Koala
☆45Mar 31, 2023Updated 3 years ago
Alternatives and similar repositories for koala-test-set
Users that are interested in koala-test-set are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The data processing pipeline for the Koala chatbot language model☆118Apr 6, 2023Updated 3 years ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- ☆22May 7, 2025Updated last year
- ☆24Dec 2, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)☆30Apr 27, 2022Updated 4 years ago
- ☆17May 19, 2023Updated 3 years ago
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,516Aug 13, 2024Updated last year
- lanmt ebm☆12Jun 19, 2020Updated 5 years ago
- ☆46May 3, 2026Updated last month
- A spoken version of the textual story cloze benchmark☆22Aug 6, 2023Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆11Oct 25, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆49Oct 21, 2023Updated 2 years ago
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- ☆11Apr 21, 2023Updated 3 years ago
- Flask 로 API 를 만들기 위한 튜토리얼☆10Jun 22, 2020Updated 5 years ago
- ☆47Apr 24, 2022Updated 4 years ago
- scripts used for SMT system submitted to WMT 2014☆12Apr 30, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- DEMix Layers for Modular Language Modeling☆54Feb 25, 2026Updated 3 months ago
- Making large AI models cheaper, faster and more accessible☆15Apr 20, 2023Updated 3 years ago
- Wikipedia based dataset to train relationship classifiers and fact extraction models☆25May 25, 2021Updated 5 years ago
- Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation☆44Jan 9, 2024Updated 2 years ago
- [ICML 2026 Oral] Agent-native Mid-training for Software Engineering☆58Jan 31, 2026Updated 4 months ago
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Oct 4, 2022Updated 3 years ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆844Jul 1, 2024Updated last year
- ☆10Feb 12, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Mar 24, 2023Updated 3 years ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆69Dec 9, 2024Updated last year
- ☆64Apr 9, 2024Updated 2 years ago
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- ☆23Oct 15, 2022Updated 3 years ago
- Code for the paper "Attention Temperature Matters in Abstractive Summarization Distillation"(https://arxiv.org/abs/2106.03441)☆13Mar 25, 2022Updated 4 years ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆96Feb 22, 2023Updated 3 years ago