IBM / Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
☆1,139Updated last year
Alternatives and similar repositories for Dromedary:
Users that are interested in Dromedary are comparing it to the libraries listed below
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆796Updated 8 months ago
- LOMO: LOw-Memory Optimization☆981Updated 8 months ago
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆784Updated 11 months ago
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,118Updated last year
- [NIPS2023] RRHF & Wombat☆804Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆919Updated 4 months ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,113Updated last year
- ☆734Updated 8 months ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆42Updated last year
- ☆906Updated 9 months ago
- ☆1,028Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆819Updated last year
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆542Updated 11 months ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆760Updated 4 months ago
- Official repository for LongChat and LongEval☆517Updated 9 months ago
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333☆1,088Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,690Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆541Updated last year
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,391Updated last year
- ☆1,455Updated last year
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,450Updated last year
- Alpaca dataset from Stanford, cleaned and curated☆1,537Updated last year
- ☆903Updated last year
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,391Updated 2 months ago
- A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …☆942Updated 2 months ago
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them☆466Updated 8 months ago
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆917Updated 8 months ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,677Updated 7 months ago
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆548Updated last year