Spico197/Humpback

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Spico197/Humpback)

Spico197 / Humpback

🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.

☆138

Alternatives and similar repositories for Humpback

Users that are interested in Humpback are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KOR-Bench / KOR-Bench
View on GitHub
☆19Nov 12, 2024Updated last year
cognitiveailab / GPT-simulator
View on GitHub
☆33Jun 12, 2024Updated 2 years ago
kangreen0210 / LIME
View on GitHub
Accelerating the development of large multimodal models (LMMs) with lmms-eval
☆14Oct 14, 2024Updated last year
yizhongw / self-instruct
View on GitHub
Aligning pretrained language models with instruction data generated by themselves.
☆4,606Mar 27, 2023Updated 3 years ago
tianyi-lab / Cherry_LLM
View on GitHub
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆417Jun 25, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
multimodal-art-projection / COIG-P
View on GitHub
☆42Jul 15, 2025Updated last year
hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆600Dec 9, 2024Updated last year
tatsu-lab / alpaca_eval
View on GitHub
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆2,007Aug 9, 2025Updated 11 months ago
Zheng0428 / COIG-Kun
View on GitHub
☆36Sep 6, 2024Updated last year
Re-Align / URIAL
View on GitHub
☆316Jun 9, 2024Updated 2 years ago
ssbuild / aigc_evals
View on GitHub
aigc evals
☆10Dec 2, 2023Updated 2 years ago
Hsuan-Tung / universal_attack_natural_trigger
View on GitHub
Natural Universal Trigger Search (NUTS)
☆21Apr 17, 2021Updated 5 years ago
OpenNLG / OpenBA
View on GitHub
☆95Oct 8, 2023Updated 2 years ago
CMMMU-Benchmark / CMMMU
View on GitHub
☆48Sep 5, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
NUSTM / LLMs-Waver-In-Judgments
View on GitHub
☆12Sep 23, 2024Updated last year
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆35Aug 15, 2024Updated last year
shuyhere / about-super-alignment
View on GitHub
Feeling confused about super alignment? Here is a reading list
☆43Jan 9, 2024Updated 2 years ago
csitfun / LogiCoT
View on GitHub
the instructions and demonstrations for building a formal logical reasoning capable GLM
☆54Sep 3, 2024Updated last year
Spico197 / MoE-SFT
View on GitHub
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
☆41Sep 29, 2024Updated last year
wwxu21 / CUT
View on GitHub
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Feb 29, 2024Updated 2 years ago
yuh-zha / Align
View on GitHub
Align, a general text alignment function
☆15Dec 7, 2023Updated 2 years ago
sustcsonglin / disco-pointer
View on GitHub
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …
☆14Aug 25, 2023Updated 2 years ago
Jacob-Zhou / gecdi
View on GitHub
The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"
☆32Jan 22, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YunjiaXi / InfoDeepSeek
View on GitHub
Code for InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation
☆19May 29, 2025Updated last year
epang-ucas / Evaluate_LLMs_to_Genes
View on GitHub
☆19May 25, 2024Updated 2 years ago
MatthewCYM / SFLM
View on GitHub
☆17Oct 19, 2021Updated 4 years ago
multi-swe-bench / MagentLess
View on GitHub
☆13Jul 31, 2025Updated 11 months ago
Edward-Sun / easy-to-hard
View on GitHub
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆124Sep 9, 2024Updated last year
lezhang7 / MOQAGPT
View on GitHub
[EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs
☆13Dec 28, 2024Updated last year
GanjinZero / RRHF
View on GitHub
[NIPS2023] RRHF & Wombat
☆805Sep 22, 2023Updated 2 years ago
alibaba / Megatron-LLaMA
View on GitHub
Best practice for training LLaMA models in Megatron-LM
☆666Jan 2, 2024Updated 2 years ago
yizhongw / llm-temporal-alignment
View on GitHub
Methods and evaluation for aligning language models temporally
☆31Mar 2, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
open-compass / ANAH
View on GitHub
[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO
☆66Apr 30, 2025Updated last year
RLHFlow / RLHF-Reward-Modeling
View on GitHub
Recipes to train reward model for RLHF.
☆1,534Apr 24, 2025Updated last year
c-box / KnowledgeLifecycle
View on GitHub
Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"
☆58Aug 24, 2023Updated 2 years ago
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
zjunlp / EasyInstruct
View on GitHub
[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.
☆407Dec 23, 2024Updated last year
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago