Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 4 months ago
Alternatives and similar repositories for SwS
Users that are interested in SwS are comparing it to the libraries listed below
Sorting:
- ☆60Mar 8, 2026Updated 2 weeks ago
- ☆46Jun 24, 2025Updated 8 months ago
- ☆18Apr 10, 2025Updated 11 months ago
- ☆13Dec 9, 2024Updated last year
- This the implementation of LeCo☆32Jan 20, 2025Updated last year
- ☆18Nov 20, 2024Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 3 months ago
- This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆64Feb 6, 2026Updated last month
- documentation used in my projects☆16Updated this week
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated 11 months ago
- ☆16Sep 4, 2025Updated 6 months ago
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆35Jul 16, 2025Updated 8 months ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- [ICLR 2026] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆48Aug 16, 2025Updated 7 months ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆30Sep 12, 2025Updated 6 months ago
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- Documentation at☆14Mar 27, 2025Updated 11 months ago
- Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling☆29Jan 24, 2026Updated last month
- Your efficient and accurate answer verification system for RL training.☆41Jun 23, 2025Updated 8 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks, ICLR 2026☆72Mar 6, 2026Updated 2 weeks ago
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models☆11Sep 19, 2025Updated 6 months ago
- A wrapper around libssh2 for .NET☆30Jan 21, 2026Updated 2 months ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated 11 months ago
- Personal Finance Expense Tracker☆20Nov 14, 2025Updated 4 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆53Jan 5, 2026Updated 2 months ago
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆33Jan 21, 2026Updated 2 months ago
- ☆10Oct 11, 2022Updated 3 years ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆24Jun 28, 2025Updated 8 months ago
- Real-time webcam demo with SmolVLM(mlx-community/SmolVLM-Instruct-4bit) and MLX-VLM☆25Jun 12, 2025Updated 9 months ago
- A powerful, interactive Python CLI for converting, manipulating, and inspecting media files using FFmpeg 🎬☆17Updated this week
- implementations of some research papers 😴😴☆21Jul 28, 2025Updated 7 months ago
- Raptor is a modern, fast, and easy-to-use system for building disk images, bootable isos, containers and much more, from a simple, Docker…☆39Feb 10, 2026Updated last month
- A simple, interactive web tool to compare pricing and performance metrics of various AI models.☆18Updated this week
- key/value store for Python based on Cloudflare workers☆33Jun 13, 2025Updated 9 months ago
- [NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking☆24Updated this week
- Simple application for tracking and managing a home schooling program.☆40Sep 13, 2025Updated 6 months ago
- Scans for used translations, compares with your translations file and removes the ones that are not in use.☆17Nov 21, 2025Updated 4 months ago