tml1026 / RoleCraftLinks
☆21Updated last year
Alternatives and similar repositories for RoleCraft
Users that are interested in RoleCraft are comparing it to the libraries listed below
Sorting:
- ☆54Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆101Updated 11 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆73Updated 8 months ago
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆24Updated last year
- ☆51Updated last year
- ☆142Updated 8 months ago
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆32Updated 8 months ago
- ☆147Updated last year
- ☆163Updated last year
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆18Updated last year
- ☆87Updated 2 years ago
- MathEval is a benchmark dedicated to the holistic evaluation on mathematical capacities of LLMs.☆86Updated last year
- Code implementation of synthetic continued pretraining☆148Updated last year
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆63Updated last year
- On Memorization of Large Language Models in Logical Reasoning☆74Updated 10 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆61Updated last year
- ☆98Updated last year
- A Bilingual Role Evaluation Benchmark for Large Language Models☆43Updated 2 years ago
- Generative Judge for Evaluating Alignment☆250Updated 2 years ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆46Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆145Updated last year
- Personality Alignment of Language Models☆53Updated 7 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆90Updated last year
- Counting-Stars (★)☆83Updated 2 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆65Updated last year
- ☆109Updated 6 months ago
- ☆51Updated last year
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆47Updated 2 years ago
- ☆147Updated last year
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆50Updated 2 years ago