explodinggradients / nemesisLinks

Reward Model framework for LLM RLHF

☆61

Alternatives and similar repositories for nemesis

Users that are interested in nemesis are comparing it to the libraries listed below

Sorting:

salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆72Updated last year
explodinggradients / Funtuner
Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆36Updated 2 years ago
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆107Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆109Updated 11 months ago
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆66Updated 2 years ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆19Updated last year
togethercomputer / Llama-2-7B-32K-Instruct
☆85Updated 2 years ago
neulab / ragged
Retrieval Augmented Generation Generalized Evaluation Dataset
☆57Updated 4 months ago
Anni-Zou / Meta-CoT
Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
☆99Updated 2 years ago
reasoning-machines / prompt-lib
A set of utilities for running few-shot prompting experiments on large-language models
☆126Updated 2 years ago
geronimi73 / phi2-finetune
☆86Updated last year
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆50Updated last year
qrdlgit / graph-of-thoughts
Based on the tree of thoughts paper
☆48Updated 2 years ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆79Updated last year
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated last year
SALT-NLP / demonstrated-feedback
☆129Updated last year
CarperAI / autocrit
A repository for transformer critique learning and generation
☆89Updated last year
patronus-ai / Lynx-hallucination-detection
☆43Updated last year
uclaml / Rephrase-and-Respond
Official repo of Respond-and-Respond: data, code, and evaluation
☆104Updated last year
salesforce / AuditNLG
AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness
☆101Updated 9 months ago
jakespringer / echo-embeddings
☆156Updated last year
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 9 months ago
tomekkorbak / pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
☆180Updated last year
yueyu1030 / AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
☆155Updated 2 years ago
deep-diver / LLM-Pref-Mark-UI
☆37Updated 2 years ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
h2oai / h2o-LLM-eval
Large-language Model Evaluation framework with Elo Leaderboard and A-B testing
☆52Updated last year
microsoft / llm-data-creation
Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"
☆134Updated 2 years ago