victorsungo / WizardLMLinks
Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder
☆45Updated last year
Alternatives and similar repositories for WizardLM
Users that are interested in WizardLM are comparing it to the libraries listed below
Sorting:
- FuseAI Project☆87Updated 4 months ago
- Reformatted Alignment☆114Updated 8 months ago
- ☆82Updated last year
- ☆47Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 11 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆136Updated 10 months ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆100Updated 10 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆22Updated last year
- Mixture-of-Experts (MoE) Language Model☆188Updated 8 months ago
- ☆94Updated 5 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆249Updated 5 months ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated last year
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated 8 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- Code for KaLM-Embedding models☆77Updated 2 months ago
- Data preparation code for Amber 7B LLM☆90Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated 2 years ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆32Updated 2 years ago
- ☆49Updated last year
- ☆76Updated last year
- ☆269Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆37Updated last year
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆101Updated 2 weeks ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆100Updated last week
- A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size☆57Updated last month
- ☆56Updated 5 months ago