Ledzy / BAdam
[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
☆214Updated last month
Related projects ⓘ
Alternatives and complementary repositories for BAdam
- A recipe for online RLHF and online iterative DPO.☆434Updated last week
- The official implementation of Self-Play Preference Optimization (SPPO)☆498Updated 3 months ago
- [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding☆230Updated 2 months ago
- Controllable Text Generation for Large Language Models: A Survey☆142Updated 2 months ago
- ☆154Updated last month
- MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models☆386Updated 9 months ago
- Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…☆381Updated last month
- PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)☆265Updated this week
- Recipes to train reward model for RLHF.☆903Updated this week
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆191Updated last month
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆182Updated last week
- The framework to prune LLMs to any size and any config.☆99Updated 8 months ago
- The nanoGPT-style implementation of RWKV Language Model - an RNN with GPT-level LLM performance.☆193Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆126Updated 5 months ago
- improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.☆532Updated 7 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆230Updated 6 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆217Updated 6 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆275Updated last year
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆234Updated 2 weeks ago
- ☆368Updated 6 months ago
- RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness☆241Updated 2 weeks ago
- Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"☆95Updated last month
- MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction☆81Updated 3 weeks ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆307Updated 2 months ago
- ☆116Updated 3 months ago
- [ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?☆149Updated last month
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆714Updated 2 weeks ago
- ☆147Updated 4 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆159Updated last year
- ☆290Updated 4 months ago