Qihoo360 / 360-LLaMA-Factory
adds Sequence Parallelism into LLaMA-Factory
☆261Updated this week
Alternatives and similar repositories for 360-LLaMA-Factory:
Users that are interested in 360-LLaMA-Factory are comparing it to the libraries listed below
- Unified KV Cache Compression Methods for Auto-Regressive Models☆911Updated 2 months ago
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆779Updated 2 weeks ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆158Updated 3 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆244Updated 3 months ago
- The framework to prune LLMs to any size and any config.☆87Updated last year
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆128Updated last month
- A recipe for online RLHF and online iterative DPO.☆488Updated 2 months ago
- Align Anything: Training All-modality Model with Feedback☆2,486Updated this week
- The official implementation of Self-Play Preference Optimization (SPPO)☆494Updated last month
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models☆31Updated last month
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆215Updated 5 months ago
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆170Updated 4 months ago
- MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…☆173Updated 3 months ago
- The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://a…☆287Updated 3 months ago
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆694Updated last month
- Multilingual Corpus of Web Fiction☆190Updated 8 months ago
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆281Updated 2 months ago
- Controllable Text Generation for Large Language Models: A Survey☆160Updated 6 months ago
- An MBTI Exploration of Large Language Models☆459Updated last year
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆158Updated 2 months ago
- improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.☆450Updated 11 months ago
- [NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models☆98Updated 8 months ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆197Updated 8 months ago
- Recipes to train reward model for RLHF.☆1,205Updated 3 weeks ago
- 从预训练到强化学习的中文llama2☆86Updated last year
- Code Efficiency Benchmark☆71Updated last month