birdofvegetables / train_grpo_qwen2.5_math
View external linksLinks

☆20

Alternatives and similar repositories for train_grpo_qwen2.5_math

Users that are interested in train_grpo_qwen2.5_math are comparing it to the libraries listed below

Sorting:

birdofvegetables / bigmodel
View on GitHub
☆17May 12, 2025Updated 9 months ago
savya08 / REN
View on GitHub
Region Encoder Network
☆18Oct 2, 2025Updated 4 months ago
terenceylchow124 / Meme-MultiModal
View on GitHub
Multimodal Model for Memotion Dataset
☆12May 17, 2021Updated 4 years ago
THU-BPM / Watermark-Radioactivity-Attack
View on GitHub
Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?". (ACL 2025 Main)
☆20Jun 18, 2025Updated 7 months ago
Tim-Siu / reinforcement-distillation
View on GitHub
Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"
☆32Jul 25, 2025Updated 6 months ago
jingedawang / StockPredictor
View on GitHub
Predict the stock price with AI models.
☆30Mar 16, 2023Updated 2 years ago
Yiwei98 / TDG
View on GitHub
☆28Mar 5, 2024Updated last year
ghwang-s / abkd
View on GitHub
ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence
☆42Aug 8, 2025Updated 6 months ago
xfgryujk / TaobaoAnalysis
View on GitHub
练习NLP，分析淘宝评论的项目
☆35May 7, 2018Updated 7 years ago
gabrielegoletto / AMEGO
View on GitHub
Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024
☆43Dec 7, 2024Updated last year
xmxoxo / Text-Opinion-Mining
View on GitHub
电商评论观点挖掘
☆43Jan 29, 2021Updated 5 years ago
hmxiong / StreamChat
View on GitHub
Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025
☆100Mar 14, 2025Updated 11 months ago
LingFeng-bbben / MajdataEdit
View on GitHub
Next-generation Simai: Note designer for maimai. The WPF editor part of the Majdata.
☆78Dec 21, 2024Updated last year
yaolinli / TimeChat-Online
View on GitHub
[ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
☆113Dec 12, 2025Updated 2 months ago
v3ucn / OpenVoiceV2_Webui_resemble_enhance
View on GitHub
基于OpenVoice和Melotts整合的中文版webui，添加resemble_enhance音频增强功能
☆100May 3, 2024Updated last year
kdiAAA / TDA
View on GitHub
[CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"
☆114Jul 15, 2024Updated last year
SakanaAI / TAID
View on GitHub
Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
☆120Oct 6, 2025Updated 4 months ago
OPTML-Group / BiP
View on GitHub
[NeurIPS22] "Advancing Model Pruning via Bi-level Optimization" by Yihua Zhang*, Yuguang Yao*, Parikshit Ram, Pu Zhao, Tianlong Chen, Min…
☆117Apr 12, 2023Updated 2 years ago
BICLab / EMS-YOLO
View on GitHub
Offical implementation of "Deep Directly-Trained Spiking Neural Networks for Object Detection" (ICCV2023)
☆186Apr 21, 2025Updated 9 months ago
SakanaAI / RLT
View on GitHub
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
☆357Jun 23, 2025Updated 7 months ago
manymore13 / report
View on GitHub
研报，行业研报，研究报告，每天定时更新
☆304Updated this week
SunlifeV / CBLPRD-330k
View on GitHub
China-Balanced-License-Plate-Recognition-Dataset-330k:A balanced dataset of 330,000 images featuring various types of Chinese license pla…
☆232Mar 31, 2023Updated 2 years ago
LingFeng-bbben / MajdataPlay
View on GitHub
A Simai Player
☆272Updated this week
OFZFZS / scrapy-pinduoduo
View on GitHub
拼多多爬虫，抓取拼多多热销商品信息和评论
☆221Sep 15, 2018Updated 7 years ago
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆404Jan 26, 2026Updated 3 weeks ago
netblind / stockPredict
View on GitHub
pytorch实现用LSTM做股票价格预测
☆303Jun 17, 2020Updated 5 years ago
Tencent / CognitiveKernel-Pro
View on GitHub
Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414
☆491Oct 17, 2025Updated 4 months ago
ypwang61 / One-Shot-RLVR
View on GitHub
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
☆408Nov 21, 2025Updated 2 months ago
xiaoquantou / jd_spider
View on GitHub
京东爬虫，可抓取京东商品信息和评论
☆279Jul 28, 2017Updated 8 years ago
SZFsir / pddSpider
View on GitHub
拼多多爬虫，爬取所有商品、评论等信息
☆298Jun 17, 2022Updated 3 years ago
zhengli97 / PromptKD
View on GitHub
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
☆348Dec 14, 2025Updated 2 months ago
yeyupiaoling / VoiceprintRecognition-Tensorflow
View on GitHub
使用Tensorflow实现声纹识别
☆327Jun 16, 2024Updated last year
ami66 / ChineseTextClassifier
View on GitHub
中文商品评论短文本分类器，可用于情感分析
☆368Dec 24, 2021Updated 4 years ago
we0091234 / yolov8-plate
View on GitHub
yolov8 车牌检测车牌识别中文车牌识别检测支持12种中文车牌支持双层车牌
☆470Jan 18, 2026Updated 3 weeks ago
pytauri / pytauri
View on GitHub
Tauri binding for Python through Pyo3
☆1,286Updated this week
LightingFx / hs300_stock_predict
View on GitHub
该项目用于对沪深300股票的预测，包括股票下载，数据清洗，LSTM 模型的训练，测试，以及实时预测
☆428Sep 26, 2021Updated 4 years ago
schmiph2 / pysepm
View on GitHub
Python implementation of performance metrics in Loizou's Speech Enhancement book
☆447Feb 15, 2025Updated last year
huyanxin / DeepComplexCRN
View on GitHub
☆467Oct 12, 2023Updated 2 years ago
anliyuan / Ultralight-Digital-Human
View on GitHub
一个超轻量级、可以在移动端实时运行的数字人模型
☆2,419Sep 18, 2025Updated 4 months ago

birdofvegetables / train_grpo_qwen2.5_mathView external linksLinks

Alternatives and similar repositories for train_grpo_qwen2.5_math

birdofvegetables / train_grpo_qwen2.5_math
View external linksLinks