birdofvegetables / train_grpo_qwen2.5_mathView external linksLinks
☆20Apr 8, 2025Updated 10 months ago
Alternatives and similar repositories for train_grpo_qwen2.5_math
Users that are interested in train_grpo_qwen2.5_math are comparing it to the libraries listed below
Sorting:
- ☆17May 12, 2025Updated 9 months ago
- Region Encoder Network☆18Oct 2, 2025Updated 4 months ago
- Multimodal Model for Memotion Dataset☆12May 17, 2021Updated 4 years ago
- Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?". (ACL 2025 Main)☆20Jun 18, 2025Updated 7 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆32Jul 25, 2025Updated 6 months ago
- Predict the stock price with AI models.☆30Mar 16, 2023Updated 2 years ago
- ☆28Mar 5, 2024Updated last year
- ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence☆42Aug 8, 2025Updated 6 months ago
- 练习NLP,分析淘宝评论的项目☆35May 7, 2018Updated 7 years ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- 电商评论观点挖掘☆43Jan 29, 2021Updated 5 years ago
- Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025☆100Mar 14, 2025Updated 11 months ago
- Next-generation Simai: Note designer for maimai. The WPF editor part of the Majdata.☆78Dec 21, 2024Updated last year
- [ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆113Dec 12, 2025Updated 2 months ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆100May 3, 2024Updated last year
- [CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"☆114Jul 15, 2024Updated last year
- Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"☆120Oct 6, 2025Updated 4 months ago
- [NeurIPS22] "Advancing Model Pruning via Bi-level Optimization" by Yihua Zhang*, Yuguang Yao*, Parikshit Ram, Pu Zhao, Tianlong Chen, Min…☆117Apr 12, 2023Updated 2 years ago
- Offical implementation of "Deep Directly-Trained Spiking Neural Networks for Object Detection" (ICCV2023)☆186Apr 21, 2025Updated 9 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆357Jun 23, 2025Updated 7 months ago
- 研报,行业研报,研究报告,每天定时更新☆304Updated this week
- China-Balanced-License-Plate-Recognition-Dataset-330k:A balanced dataset of 330,000 images featuring various types of Chinese license pla…☆232Mar 31, 2023Updated 2 years ago
- A Simai Player☆272Updated this week
- 拼多多爬虫,抓取拼多多热销商品信息和评论☆221Sep 15, 2018Updated 7 years ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆404Jan 26, 2026Updated 3 weeks ago
- pytorch实现用LSTM做股票价格预测☆303Jun 17, 2020Updated 5 years ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆491Oct 17, 2025Updated 4 months ago
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆408Nov 21, 2025Updated 2 months ago
- 京东爬虫,可抓取京东商品信息和评论☆279Jul 28, 2017Updated 8 years ago
- 拼多多爬虫,爬取所有商品、评论等信息☆298Jun 17, 2022Updated 3 years ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆348Dec 14, 2025Updated 2 months ago
- 使用Tensorflow实现声纹识别☆327Jun 16, 2024Updated last year
- 中文商品评论短文本分类器,可用于情感分析☆368Dec 24, 2021Updated 4 years ago
- yolov8 车牌检测 车牌识别 中文车牌识别 检测 支持12种中文车牌 支持双层车牌☆470Jan 18, 2026Updated 3 weeks ago
- Tauri binding for Python through Pyo3☆1,286Updated this week
- 该项目用于对沪深300股票的预测,包括股票下载,数据清洗,LSTM 模型的训练,测试,以及实时预测☆428Sep 26, 2021Updated 4 years ago
- Python implementation of performance metrics in Loizou's Speech Enhancement book☆447Feb 15, 2025Updated last year
- ☆467Oct 12, 2023Updated 2 years ago
- 一个超轻量级、可以在移动端实时运行的数字人模型☆2,419Sep 18, 2025Updated 4 months ago