victorsungo / WizardLM
Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder
☆45Updated 11 months ago
Alternatives and similar repositories for WizardLM:
Users that are interested in WizardLM are comparing it to the libraries listed below
- FuseAI Project☆84Updated 2 months ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆100Updated 8 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- ☆36Updated 2 years ago
- code for Scaling Laws of RoPE-based Extrapolation☆72Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- ☆81Updated 11 months ago
- Reformatted Alignment☆115Updated 6 months ago
- Unofficial implementation of AlpaGasus☆90Updated last year
- Self-Controlled Memory System for LLMs☆46Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 10 months ago
- ☆44Updated 3 months ago
- ☆59Updated 11 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆130Updated 9 months ago
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆22Updated 9 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆131Updated last month
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated last year
- The RedStone repository includes code for preparing extensive datasets used in training large language models.☆120Updated last month
- ☆120Updated 9 months ago
- Open Implementations of LLM Analyses☆103Updated 5 months ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆97Updated 8 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆98Updated last year
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 6 months ago
- Official implementation of paper "Autonomous Data Selection with Language Models for Mathematical Texts" (As Huggingface Daily Papers: ht…☆80Updated 4 months ago
- Mixture-of-Experts (MoE) Language Model☆185Updated 6 months ago
- ☆117Updated 7 months ago
- ☆36Updated 6 months ago
- ☆94Updated 3 months ago