kaistAI / SelFee
Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"
☆226Updated last year
Alternatives and similar repositories for SelFee:
Users that are interested in SelFee are comparing it to the libraries listed below
- ☆173Updated last year
- FireAct: Toward Language Agent Fine-tuning☆275Updated last year
- ☆270Updated 2 years ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated 2 years ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆207Updated last year
- Simple next-token-prediction for RLHF☆225Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆177Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆206Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆300Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆547Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆260Updated last year
- Unofficial implementation of AlpaGasus☆91Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆250Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆240Updated last year
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆370Updated 8 months ago
- ☆121Updated 10 months ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆138Updated 2 years ago
- Generative Judge for Evaluating Alignment☆236Updated last year
- ☆309Updated 10 months ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- A framework for human-readable prompt-based method with large language models. Specially designed for researchers. (Deprecated, check out…☆130Updated 2 years ago
- Open Source WizardCoder Dataset☆158Updated last year
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆332Updated 11 months ago
- Self-Alignment with Principle-Following Reward Models☆160Updated last year
- ☆314Updated 7 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Official repository for LongChat and LongEval☆519Updated 11 months ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆209Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆234Updated last year