kaistAI / SelFeeLinks
Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"
☆227Updated last year
Alternatives and similar repositories for SelFee
Users that are interested in SelFee are comparing it to the libraries listed below
Sorting:
- ☆269Updated 2 years ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆177Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆206Updated 2 years ago
- FireAct: Toward Language Agent Fine-tuning☆278Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated 2 years ago
- ☆172Updated last year
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆207Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆211Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆300Updated 2 years ago
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆371Updated 9 months ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆139Updated 2 years ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆263Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆250Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆548Updated last year
- ☆361Updated 2 years ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year
- SAIL: Search Augmented Instruction Learning☆157Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆241Updated last year
- Simple next-token-prediction for RLHF☆226Updated last year
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆285Updated 3 months ago
- Official repository for LongChat and LongEval☆518Updated last year
- ☆309Updated 11 months ago
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆336Updated last year
- Unofficial implementation of AlpaGasus☆91Updated last year
- ☆121Updated 11 months ago
- ☆156Updated last year
- Self-Alignment with Principle-Following Reward Models☆161Updated 3 weeks ago
- Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…☆346Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆233Updated last year