Spico197 / HumpbackLinks
π An unofficial implementation of Self-Alignment with Instruction Backtranslation.
β138Updated 8 months ago
Alternatives and similar repositories for Humpback
Users that are interested in Humpback are comparing it to the libraries listed below
Sorting:
- β143Updated 2 years ago
- Generative Judge for Evaluating Alignmentβ248Updated last year
- Counting-Stars (β )β83Updated last month
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Modelsβ118Updated 7 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningβ184Updated 6 months ago
- Collection of papers for scalable automated alignment.β93Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)β99Updated 10 months ago
- β147Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningβ284Updated 2 years ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Modelsβ269Updated last year
- Do Large Language Models Know What They Donβt Know?β102Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.β132Updated 2 years ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)β50Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenariosβ73Updated 7 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Modelsβ194Updated last year
- β282Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimationβ90Updated last year
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Explorationβ36Updated last year
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.β52Updated 2 years ago
- A large-scale, fine-grained, diverse preference dataset (and models).β359Updated 2 years ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"β136Updated last year
- https://acl2023-retrieval-lm.github.io/β156Updated 2 years ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`β216Updated 5 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsβ198Updated last month
- Datasets for Instruction Tuning of Large Language Modelsβ260Updated 2 years ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Followingβ134Updated last year
- β294Updated 2 years ago
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialoguesβ136Updated last year
- Data and Code for Program of Thoughts [TMLR 2023]β302Updated last year
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMsβ46Updated last year