CarperAI/autocrit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CarperAI/autocrit)

CarperAI / autocrit

A repository for transformer critique learning and generation

☆88

Alternatives and similar repositories for autocrit

Users that are interested in autocrit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Dahoas / reward-modeling
View on GitHub
☆98May 30, 2023Updated 3 years ago
vicgalle / zero-shot-reward-models
View on GitHub
ZYN: Zero-Shot Reward Models with Yes-No Questions
☆34Aug 15, 2023Updated 2 years ago
RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
EleutherAI / magiCARP
View on GitHub
One stop shop for all things carp
☆58Sep 9, 2022Updated 3 years ago
icip-cas / LiteCoder
View on GitHub
Advancing Small and Medium-sized Code Agents.
☆17May 29, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
haoliuhl / chain-of-hindsight
View on GitHub
Simple next-token-prediction for RLHF
☆228Sep 30, 2023Updated 2 years ago
CarperAI / trlx
View on GitHub
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,752Jan 8, 2024Updated 2 years ago
EleutherAI / elk
View on GitHub
Keeping language models honest by directly eliciting knowledge encoded in their activations.
☆221Updated this week
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
EleutherAI / mdl
View on GitHub
Minimum Description Length probing for neural network representations
☆20Jan 28, 2025Updated last year
HomebrewML / Olmax
View on GitHub
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Jan 20, 2024Updated 2 years ago
stefan-it / gc4lm
View on GitHub
GC4LM: A Colossal (Biased) language model for German
☆13May 2, 2021Updated 5 years ago
isle-dev / MetricEval
View on GitHub
MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…
☆12Nov 6, 2023Updated 2 years ago
viking-sudo-rm / rusty-dawg
View on GitHub
Rust library for indexing and quickly searching large pretraining corpora
☆31Oct 30, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
euclaise / supertrainer2000
View on GitHub
☆50Mar 14, 2024Updated 2 years ago
LAION-AI / Anh
View on GitHub
Anh - LAION's multilingual assistant datasets and models
☆28Apr 5, 2023Updated 3 years ago
acmi-lab / pretraining-with-nonsense
View on GitHub
Pretraining summarization models using a corpus of nonsense
☆13Sep 28, 2021Updated 4 years ago
TehVenomm / LM_Transformers_BlockMerge
View on GitHub
Image Diffusion block merging technique applied to transformers based Language Models.
☆55May 8, 2023Updated 3 years ago
arthurpaulino / NumLean
View on GitHub
A Lean 4 package for heavy numerical computations
☆20Jan 16, 2022Updated 4 years ago
r-three / RAD
View on GitHub
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆45Oct 1, 2025Updated 9 months ago
Overworldai / owl-wms
View on GitHub
Basic world models
☆33Oct 30, 2025Updated 8 months ago
iwalton3 / mpt-lora-patch
View on GitHub
Patch for MPT-7B which allows using and training a LoRA
☆57May 20, 2023Updated 3 years ago
Damilytutu / SEM-MEM
View on GitHub
An Improved LSTM-based Network: Learning Explicit Shape and Motion Evolution Maps for Skeleton-based Human Action Revognition
☆14Oct 21, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
OpenBMB / UltraFeedback
View on GitHub
A large-scale, fine-grained, diverse preference dataset (and models).
☆368Dec 29, 2023Updated 2 years ago
facebookresearch / mmd
View on GitHub
ML models often mispredict, and it is hard to tell when and why. We present a data mining based approach to discover whether there is a c…
☆17Jun 6, 2022Updated 4 years ago
kyegomez / VisionLLaMA
View on GitHub
Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
☆15Nov 11, 2024Updated last year
kernelmachine / cbtm
View on GitHub
Code repository for the c-BTM paper
☆109Sep 26, 2023Updated 2 years ago
Alignment-Lab-AI / datagen
View on GitHub
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆32Sep 22, 2024Updated last year
BishalN / Threadgenie
View on GitHub
Effortlessly Create Engaging and Informative Threads in Minutes
☆14Feb 3, 2023Updated 3 years ago
zhaoxlpku / SubgoalXL
View on GitHub
☆26Aug 23, 2024Updated last year
g588928812 / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆11Jul 22, 2023Updated 3 years ago
huggingface / olm-training
View on GitHub
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆98Feb 9, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GanjinZero / RRHF
View on GitHub
[NIPS2023] RRHF & Wombat
☆805Sep 22, 2023Updated 2 years ago
The-Swarm-Corporation / AgentParse
View on GitHub
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆18Oct 13, 2025Updated 9 months ago
CarperAI / OpenELM
View on GitHub
Evolution Through Large Models
☆743Nov 15, 2023Updated 2 years ago
RUCAIBox / RLMEC
View on GitHub
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆39Jan 12, 2024Updated 2 years ago
Gryphe / BlockMerge_Gradient
View on GitHub
Merge Transformers language models by use of gradient parameters.
☆215Aug 8, 2024Updated last year
Rallio67 / language-model-agents
View on GitHub
Experiments with generating opensource language model assistants
☆97May 14, 2023Updated 3 years ago
voidism / L2KD
View on GitHub
Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123
☆12Jul 13, 2021Updated 5 years ago