syncdoth/Chain-of-Hindsight-PyTorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/syncdoth/Chain-of-Hindsight-PyTorch)

syncdoth / Chain-of-Hindsight-PyTorch

Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.

☆11

Alternatives and similar repositories for Chain-of-Hindsight-PyTorch

Users that are interested in Chain-of-Hindsight-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liziniu / policy_optimization
View on GitHub
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
☆29Dec 19, 2023Updated 2 years ago
mlwu22 / RED
View on GitHub
Implementation code for ACL2024：Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆15Apr 20, 2024Updated 2 years ago
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
doerjiayi / algorithm
View on GitHub
☆11Jul 31, 2020Updated 5 years ago
TrueNobility303 / F2BA
View on GitHub
A tale of works on the complexity of first-order bilevel optimization.
☆25Jan 27, 2026Updated 6 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
vfleaking / PTST
View on GitHub
Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"
☆22Sep 21, 2025Updated 10 months ago
thu-coai / ComplexBench
View on GitHub
Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)
☆102Feb 20, 2025Updated last year
haoyuzhao123 / LeanIneqComp
View on GitHub
An inequality benchmark for theorem proving
☆22Feb 1, 2026Updated 5 months ago
hermish / cvx-graph-algorithms
View on GitHub
Implementations of modern convex optimization-based graph algorithms in Python. Available on the Python Package Index (PyPI).
☆16Jul 18, 2019Updated 7 years ago
LibreCV / blog
View on GitHub
Blog of the LibreCV.org
☆10May 17, 2021Updated 5 years ago
cofe-ai / fast-gector
View on GitHub
☆63Aug 2, 2023Updated 2 years ago
mansheej / icl-task-diversity
View on GitHub
Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"
☆27Jun 28, 2023Updated 3 years ago
BangLab-UdeM-Mila / NLP4MatSci-ACL23
View on GitHub
This repository contains the dataset and code for our ACL'23 publication: "MatSci-NLP: Evaluating Scientific Language Models on Materials…
☆17Nov 21, 2023Updated 2 years ago
PKU-TANGENT / ConFiguRe
View on GitHub
Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"
☆12Jul 27, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
KuaiSearchPERKS / PERKS
View on GitHub
KuaiSearch PERKS
☆12Nov 16, 2021Updated 4 years ago
mpsilfve / ocrpp
View on GitHub
OCR post processing and spelling correction.
☆11Nov 12, 2018Updated 7 years ago
pkunlp-icler / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022
☆13Apr 13, 2022Updated 4 years ago
tedmoskovitz / ConstrainedRL4LMs
View on GitHub
A library for constrained RLHF.
☆13Feb 19, 2024Updated 2 years ago
Eajack / NLP-ML_CS-Cpp_Review
View on GitHub
NLP/ML面试各类资料链接汇总（主要Github收集）
☆11Mar 3, 2020Updated 6 years ago
M3-IT / YING-VLM
View on GitHub
Vision Large Language Models trained on M3IT instruction tuning dataset
☆17Aug 16, 2023Updated 2 years ago
causalNLP / amr_llm
View on GitHub
This repo explores how AMR to address tasks difficult for LLMs
☆13Jan 15, 2024Updated 2 years ago
momo-journey / CDial-GPT-NEZHA
View on GitHub
pytorch版基于gpt+nezha的中文多轮Cdial
☆11Oct 22, 2022Updated 3 years ago
EricLee8 / MPD_EMVI
View on GitHub
Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference
☆10Jul 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chenllliang / ParetoMNMT
View on GitHub
Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023
☆17Sep 27, 2023Updated 2 years ago
Kaffaljidhmah2 / Arxiv-Recommender
View on GitHub
☆50Oct 24, 2023Updated 2 years ago
lifan-yuan / FactMix
View on GitHub
Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"
☆15Jan 15, 2023Updated 3 years ago
StefanHeng / ProgGen
View on GitHub
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
☆17Mar 29, 2024Updated 2 years ago
JiaQiSJTU / FaithEval-FFLM
View on GitHub
A zero-shot faithfulness evaluation metric for text summarization
☆11Oct 17, 2023Updated 2 years ago
facebookresearch / llm-cross-capabilities
View on GitHub
Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"
☆43Oct 1, 2024Updated last year
lpq29743 / HAN-PL
View on GitHub
A Pytorch implementation for "Hierarchical Attention Network with Pairwise Loss for Chinese Zero Pronoun Resolution“ (AAAI 2020).
☆10Dec 10, 2020Updated 5 years ago
dqxiu / KAssess
View on GitHub
☆14Oct 28, 2023Updated 2 years ago
tencent-ailab / OASum
View on GitHub
☆15Oct 20, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
armingh2000 / FactScoreLite
View on GitHub
FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…
☆14Apr 25, 2024Updated 2 years ago
machine-reasoning-ufrgs / GNN-GCP
View on GitHub
Graph Neural Network architecture to solve the decision version of the graph coloring problem (GCP)
☆25Jan 27, 2020Updated 6 years ago
Wsky51 / TsinghuaJS
View on GitHub
为准备2020年清华机计算机复试机试题而做的笔记
☆11Apr 17, 2023Updated 3 years ago
ikergarcia1996 / Sequence-Labeling-LLMs
View on GitHub
The code to perform Sequence Labelling with LLMs, including T5, FLAN, LLaMA, Alpaca and more!
☆14Nov 5, 2024Updated last year
ad-freiburg / tokenization-repair
View on GitHub
Correction of spaces with character-based neural language models.
☆13Aug 23, 2022Updated 3 years ago
bytebuff / dj-poetry-es
View on GitHub
django+es搭建的前后端分离，唐诗宋词搜索引擎。
☆36Apr 22, 2022Updated 4 years ago
victor7246 / gated-Transformer
View on GitHub
Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling
☆10May 29, 2021Updated 5 years ago