explodinggradients / nemesisLinks
Reward Model framework for LLM RLHF
☆61Updated 2 years ago
Alternatives and similar repositories for nemesis
Users that are interested in nemesis are comparing it to the libraries listed below
Sorting:
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- Open Implementations of LLM Analyses☆107Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 11 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- PyTorch implementation for MRL☆19Updated last year
- ☆85Updated 2 years ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆57Updated 4 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆99Updated 2 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- ☆86Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- ☆129Updated last year
- A repository for transformer critique learning and generation☆89Updated last year
- ☆43Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated last year
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆101Updated 9 months ago
- ☆156Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 9 months ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆155Updated 2 years ago
- ☆37Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated 2 years ago
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testing☆52Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆134Updated 2 years ago