victorsungo / WizardLMLinks
Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder
☆44Updated last year
Alternatives and similar repositories for WizardLM
Users that are interested in WizardLM are comparing it to the libraries listed below
Sorting:
- FuseAI Project☆87Updated 6 months ago
- Mixture-of-Experts (MoE) Language Model☆189Updated 11 months ago
- Langchain implementation of HuggingGPT☆132Updated 2 years ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- ☆94Updated 8 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆60Updated last year
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆188Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆319Updated 10 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆166Updated last year
- ☆124Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated 9 months ago
- Official implementation for "OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities" (keep updating)☆60Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆136Updated last year
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆140Updated 2 years ago
- ☆35Updated 2 years ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆214Updated last month
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆103Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆140Updated last year
- Data preparation code for Amber 7B LLM☆91Updated last year
- ☆83Updated last year
- Open Implementations of LLM Analyses☆106Updated 10 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆253Updated last year
- Reformatted Alignment☆113Updated 10 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated 11 months ago