explodinggradients / nemesis
Reward Model framework for LLM RLHF
☆56Updated last year
Related projects: ⓘ
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- Small and Efficient Mathematical Reasoning LLMs☆69Updated 7 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆60Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆68Updated last week
- ☆105Updated this week
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆84Updated 11 months ago
- ☆42Updated 2 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆32Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated 2 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆67Updated 2 months ago
- Evaluation and analysis code for LLM360☆75Updated 3 months ago
- ☆34Updated last year
- A repository for transformer critique learning and generation☆84Updated 9 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated this week
- ☆37Updated last year
- ☆24Updated last year
- ☆85Updated 7 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆90Updated last year
- Based on the tree of thoughts paper☆45Updated last year
- ☆52Updated 7 months ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆40Updated 9 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web"☆106Updated last week
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆107Updated 11 months ago
- ☆30Updated 4 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆99Updated last month
- ☆109Updated last month
- Track the progress of LLM context utilisation☆53Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆134Updated 6 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆72Updated 8 months ago