young-geng / koala_data_pipelineLinks

The data processing pipeline for the Koala chatbot language model

☆118

Alternatives and similar repositories for koala_data_pipeline

Users that are interested in koala_data_pipeline are comparing it to the libraries listed below

Sorting:

arnav-gudibande / koala-test-set
The test set for Koala
☆45Updated 2 years ago
togethercomputer / Llama-2-7B-32K-Instruct
☆85Updated 2 years ago
nlpxucan / evol-instruct
☆277Updated 2 years ago
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated 2 years ago
kaistAI / SelFee
Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"
☆228Updated 2 years ago
bhargaviparanjape / language-programmes
☆173Updated 2 years ago
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆302Updated 2 years ago
togethercomputer / OpenDataHub
☆128Updated 2 years ago
dsdanielpark / open-llm-leaderboard-report
Weekly visualization report of Open LLM model performance based on 4 metrics.
☆86Updated last year
xrsrke / toolformer
Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools
☆144Updated 2 years ago
liutiedong / goat
a Fine-tuned LLaMA that is Good at Arithmetic Tasks
☆178Updated 2 years ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆72Updated last year
manyoso / haltt4llm
This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…
☆223Updated 2 years ago
LAION-AI / Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
☆209Updated last year
orhonovich / unnatural-instructions
☆180Updated 2 years ago
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆170Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆79Updated last year
allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆93Updated last year
wang-research-lab / agentinstruct
Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"
☆117Updated last month
haoliuhl / chain-of-hindsight
Simple next-token-prediction for RLHF
☆227Updated 2 years ago
salesforce / CodeGen2
CodeGen2 models for program synthesis
☆271Updated 2 years ago
conceptofmind / toolformer
☆379Updated 2 years ago
LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆307Updated last year
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago
bigcode-project / bigcode-analysis
Repository for analysis and experiments in the BigCode project.
☆127Updated last year
google / sycophancy-intervention
Scripts for generating synthetic finetuning data for reducing sycophancy.
☆117Updated 2 years ago
luohongyin / SAIL
SAIL: Search Augmented Instruction Learning
☆158Updated 4 months ago
kernelmachine / cbtm
Code repository for the c-BTM paper
☆108Updated 2 years ago
Anni-Zou / Meta-CoT
Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
☆99Updated 2 years ago
HazyResearch / TART
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆201Updated 2 years ago