QwenLM / WorldPM
☆37Updated this week
Alternatives and similar repositories for WorldPM
Users that are interested in WorldPM are comparing it to the libraries listed below
Sorting:
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- ☆47Updated 5 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆37Updated last year
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated 2 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 5 months ago
- ☆16Updated 2 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆64Updated this week
- ☆24Updated 3 months ago
- ☆25Updated 8 months ago
- ☆30Updated 9 months ago
- ☆27Updated 2 months ago
- Pivotal Token Search☆23Updated this week
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆36Updated 2 months ago
- Collection of model-centric MCP servers☆14Updated last week
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆24Updated 3 months ago
- Evaluation of bm42 sparse indexing algorithm☆65Updated 10 months ago
- ☆20Updated last month
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆44Updated this week
- ☆43Updated 7 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆23Updated last month
- ☆42Updated 2 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆50Updated last month
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated 2 months ago
- ☆46Updated this week
- ☆64Updated last month
- ☆28Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆77Updated last year
- ☆20Updated 11 months ago