explodinggradients / nemesis
Reward Model framework for LLM RLHF
☆61Updated last year
Alternatives and similar repositories for nemesis:
Users that are interested in nemesis are comparing it to the libraries listed below
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- ☆24Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 6 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- ☆37Updated last year
- ☆119Updated 6 months ago
- ☆44Updated 4 months ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆94Updated last year
- A repository for transformer critique learning and generation☆89Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- Open Implementations of LLM Analyses☆102Updated 6 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆53Updated 4 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆23Updated last year
- Track the progress of LLM context utilisation☆54Updated this week
- ☆48Updated 5 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- PyTorch implementation for MRL☆18Updated last year
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆109Updated last year
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆100Updated 2 months ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year