explodinggradients / nemesis
Reward Model framework for LLM RLHF
β58Updated last year
Alternatives and similar repositories for nemesis:
Users that are interested in nemesis are comparing it to the libraries listed below
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β66Updated 2 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."β63Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ71Updated 11 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ34Updated last year
- Advanced Reasoning Benchmark Dataset for LLMsβ45Updated last year
- A repository for transformer critique learning and generationβ88Updated last year
- β37Updated last year
- Codebase accompanying the Summary of a Haystack paper.β74Updated 3 months ago
- β48Updated last year
- β24Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 6 months ago
- β116Updated 3 months ago
- β74Updated last year
- Writing Blog Posts with Generative Feedback Loops!β46Updated 10 months ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.β71Updated 2 years ago
- Just a bunch of benchmark logs for different LLMsβ116Updated 5 months ago
- β34Updated last year
- Open Implementations of LLM Analysesβ98Updated 3 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)β29Updated 10 months ago
- PyTorch implementation for MRLβ18Updated 10 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Modelsβ22Updated 11 months ago
- Graph-based method for end-to-end code completion with context awareness on repositoryβ54Updated 4 months ago
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ42Updated 11 months ago
- Retrieval Augmented Generation Generalized Evaluation Datasetβ52Updated last month
- Explore the use of DSPy for extracting features from PDFs πβ37Updated 10 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptionsβ68Updated last year
- β137Updated 9 months ago
- β96Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkIβ91Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuningβ221Updated last year