Open-Assistant / oasst-model-evalLinks
Evaluation of the Open-Assistant language models
☆29Updated 5 months ago
Alternatives and similar repositories for oasst-model-eval
Users that are interested in oasst-model-eval are comparing it to the libraries listed below
Sorting:
- ☆416Updated 2 years ago
- batched loras☆349Updated 2 years ago
- A bagel, with everything.☆326Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆204Updated last year
- ☆457Updated 2 years ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆216Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆358Updated 2 years ago
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆224Updated 2 years ago
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆163Updated 2 years ago
- Code repository for the c-BTM paper☆108Updated 2 years ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆144Updated 2 years ago
- A minimum example of aligning language models with RLHF similar to ChatGPT☆225Updated 2 years ago
- Pre-training code for Amber 7B LLM☆170Updated last year
- ☆95Updated 2 years ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆226Updated 4 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Updated 2 years ago
- Inference code for Persimmon-8B☆412Updated 2 years ago
- GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ☆101Updated 2 years ago
- React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.☆232Updated 2 years ago
- A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick☆294Updated 2 years ago
- Tune MPTs☆84Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆209Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆304Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- Experiments on speculative sampling with Llama models☆127Updated 2 years ago
- Multi-Domain Expert Learning☆67Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆213Updated last year