thu-coai / CritiqueLLM
β141Updated 8 months ago
Alternatives and similar repositories for CritiqueLLM:
Users that are interested in CritiqueLLM are comparing it to the libraries listed below
- π An unofficial implementation of Self-Alignment with Instruction Backtranslation.β137Updated 8 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningβ244Updated last year
- β45Updated 9 months ago
- β134Updated 11 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenariosβ66Updated 3 months ago
- β97Updated 11 months ago
- β95Updated last year
- β128Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningβ146Updated 6 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"β123Updated 9 months ago
- β105Updated last month
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Modelsβ40Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimationβ75Updated 4 months ago
- SuperCLUE-Agent: εΊδΊδΈζεηδ»»ε‘ηAgentζΊθ½δ½ζ ΈεΏθ½εζ΅θ―εΊεβ83Updated last year
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMsβ241Updated 3 months ago
- β48Updated last year
- β160Updated last year
- δΈζ倧θ―θ¨ζ¨‘εθ―ζ΅η¬¬δΊζβ70Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)β70Updated 3 weeks ago
- β101Updated 3 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDINGβ87Updated 11 months ago
- β46Updated last month
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.β41Updated 8 months ago
- Unofficial implementation of AlpaGasusβ90Updated last year
- β165Updated last year
- code for Scaling Laws of RoPE-based Extrapolationβ70Updated last year
- Code implementation of synthetic continued pretrainingβ94Updated 2 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planningβ214Updated 2 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"β226Updated last month
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ114Updated last year