Qwen2.5 0.5B GRPO
☆80Feb 16, 2025Updated last year
Alternatives and similar repositories for qwen2.5-0.5b-grpo
Users that are interested in qwen2.5-0.5b-grpo are comparing it to the libraries listed below
Sorting:
- ☆14Jul 24, 2025Updated 7 months ago
- OpenVLA Lightweight Version(0.5B). It uses qwen2-0.5B and fine-tunes using mllm format, without occupying LLM's inherent tokens. It repre…☆16Jan 7, 2026Updated 2 months ago
- Collection of works for evaluating (and analyzing) large audio-language models (LALMs)☆39Aug 11, 2025Updated 7 months ago
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆27Jun 5, 2025Updated 9 months ago
- "BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks"☆13May 10, 2024Updated last year
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- yolov12 部署rknn☆14Mar 3, 2025Updated last year
- YOLOv11-pruning based on constraint of BN layer gamma values.☆22Jan 17, 2025Updated last year
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- qwen2 and llama3 cpp implementation☆48Jun 7, 2024Updated last year
- ☆19Nov 17, 2025Updated 4 months ago
- This is a general framework for fake audio detection using pytorch lightning☆27Jul 24, 2025Updated 7 months ago
- ☆10Apr 16, 2024Updated last year
- [CVPR 2025🔥] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"☆20Apr 18, 2025Updated 11 months ago
- Two-Path-Transformer-Based Generative Adversarial Network Using Joint Magnitude Masking And Complex Spectral Mapping For Speech Enhanceme…☆16May 29, 2024Updated last year
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- ☆10Jul 27, 2021Updated 4 years ago
- Dataset2024☆12Jun 12, 2025Updated 9 months ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Apr 25, 2024Updated last year
- ☆13Aug 11, 2018Updated 7 years ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆45Apr 1, 2025Updated 11 months ago
- Utilized attention incorporated UNet model for conditional image generation using Flow Matching with Conditional Optimal Transport Object…☆13Dec 29, 2023Updated 2 years ago
- A comparison of using different feature descriptors (SI, SIFT, SHOT, CSHOT, FPFH) and different keypoints detection algorithm (SIFT3D, I…☆18Feb 9, 2021Updated 5 years ago
- ☆12Mar 13, 2023Updated 3 years ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆76Jun 16, 2025Updated 9 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆55Aug 28, 2025Updated 6 months ago
- llm & rl☆278Oct 24, 2025Updated 4 months ago
- ☆23Oct 17, 2024Updated last year
- simple decoder-only GTP model in pytorch☆43May 19, 2024Updated last year
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- This repository contains the source code related to the paper Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation☆11Jun 23, 2020Updated 5 years ago
- 阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理☆139May 17, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- Code for paper "Masked Pre-training Enables Universal Zero-shot Denoiser" [NeurIPS 2024].☆35Nov 20, 2024Updated last year
- Project for training SSL-based deepfake speech detector☆47Feb 2, 2026Updated last month
- ☆29Jul 1, 2023Updated 2 years ago
- LCA-on-the-line (ICML 2024 Oral)☆13Feb 13, 2025Updated last year
- ☆13Dec 28, 2023Updated 2 years ago