victorsungo / WizardLMLinks
Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder
☆45Updated last year
Alternatives and similar repositories for WizardLM
Users that are interested in WizardLM are comparing it to the libraries listed below
Sorting:
- FuseAI Project☆87Updated 5 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆98Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- Self-Controlled Memory System for LLMs☆49Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆38Updated last year
- ☆94Updated 6 months ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆100Updated 11 months ago
- ☆121Updated 10 months ago
- ☆59Updated last year
- Open efforts to implement ChatGPT-like models and beyond.☆107Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Receipts for creating AI Applications with APIs from DashScope (and friends)!☆56Updated 8 months ago
- ☆58Updated 11 months ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆121Updated 5 months ago
- Unofficial implementation of AlpaGasus☆91Updated last year
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- ☆40Updated last year
- Reformatted Alignment☆113Updated 9 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆133Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆57Updated last year
- ☆82Updated last year
- ☆121Updated last year
- Code for KaLM-Embedding models☆78Updated 3 months ago
- ☆56Updated 6 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆137Updated 11 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆112Updated 9 months ago
- ☆33Updated 2 years ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated 9 months ago