Spico197 / Humback
π An unofficial implementation of Self-Alignment with Instruction Backtranslation.
β132Updated 4 months ago
Related projects β
Alternatives and complementary repositories for Humback
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningβ125Updated 2 months ago
- β120Updated 7 months ago
- β129Updated 4 months ago
- β71Updated 10 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningβ218Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimationβ67Updated last week
- [ACL 2024] Long-Context Language Modeling with Parallel Encodingsβ144Updated 5 months ago
- β88Updated last month
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenariosβ62Updated 7 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don'tβ¦β83Updated 4 months ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ106Updated last year
- β53Updated 4 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsβ145Updated 5 months ago
- β133Updated last year
- 𧬠RegMix: Data Mixture as Regression for Language Model Pre-trainingβ88Updated last month
- Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"β89Updated 3 weeks ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)β44Updated 7 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsβ219Updated 2 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDINGβ87Updated 7 months ago
- [SIGIR'24] The official implementation code of MOELoRA.β124Updated 3 months ago
- Counting-Stars (β )β76Updated 2 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moβ¦β307Updated 2 months ago
- β91Updated 11 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"β109Updated 5 months ago
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialoguesβ51Updated 3 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.β125Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?β70Updated 9 months ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuningβ33Updated 9 months ago
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"β46Updated last year
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generβ¦β58Updated 4 months ago