QwenLM / QwQ
QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.
☆402Updated this week
Alternatives and similar repositories for QwQ:
Users that are interested in QwQ are comparing it to the libraries listed below
- adds Sequence Parallelism into LLaMA-Factory☆437Updated this week
- The official implementation of Self-Play Preference Optimization (SPPO)☆515Updated 2 months ago
- PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoki…☆1,045Updated last month
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆176Updated last week
- DeepRetrieval - Hacking 🔥Real Search Engines and Text/Data Retrievers with LLM + RL☆201Updated this week
- minimal-cost for training 0.5B R1-Zero☆673Updated this week
- Unified KV Cache Compression Methods for Auto-Regressive Models☆966Updated 2 months ago
- Pioneering Multimodal Reasoning with CoT☆1,157Updated this week
- Codebase for Iterative DPO Using Rule-based Rewards☆230Updated this week
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆142Updated 3 weeks ago
- Align Anything: Training All-modality Model with Feedback☆3,063Updated last week
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,158Updated this week
- "VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"☆510Updated this week
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆249Updated 2 weeks ago
- Recipes to train the self-rewarding reasoning LLMs.☆207Updated 3 weeks ago
- VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)☆421Updated 7 months ago
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆321Updated 3 months ago
- The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM…☆134Updated last month
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆705Updated 2 months ago
- A highly optimized LLM inference acceleration engine for Llama and its variants.☆881Updated last week
- Build multimodal language agents for fast prototype and production☆2,451Updated last week
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆218Updated 6 months ago
- Recipes to train reward model for RLHF.☆1,257Updated last month
- "GraphAgent: Agentic Graph Language Assistant"☆292Updated last month
- The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://a…☆292Updated 4 months ago
- Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"☆282Updated 2 months ago
- MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…☆174Updated last week
- "Your Fully-Automated Personal AI Assistant, and Open-Source & Cost-Efficient Alternative to OpenAI's Deep Research"☆838Updated last month
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆343Updated last month
- A recipe for online RLHF and online iterative DPO.☆502Updated 3 months ago