owenliang / qwen2.5-0.5b-grpo
View external linksLinks

Qwen2.5 0.5B GRPO

☆78

Alternatives and similar repositories for qwen2.5-0.5b-grpo

Users that are interested in qwen2.5-0.5b-grpo are comparing it to the libraries listed below

Sorting:

IP127000 / openVLA-Qwen2-0.5B
View on GitHub
OpenVLA Lightweight Version(0.5B). It uses qwen2-0.5B and fine-tunes using mllm format, without occupying LLM's inherent tokens. It repre…
☆15Jan 7, 2026Updated last month
vietthanh2710 / AC-MambaSeg
View on GitHub
☆18Nov 17, 2025Updated 3 months ago
WWWWxp / Speech-Tokenizer-Papers
View on GitHub
This repository collects papers related to Speech Tokenizer.
☆17Oct 16, 2024Updated last year
xieyuankun / All-Type-ADD
View on GitHub
This is the repo of our work titled “Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception”
☆26May 21, 2025Updated 8 months ago
Ashigarg123 / ShiftySpeech
View on GitHub
☆14Jul 24, 2025Updated 6 months ago
rkuo2000 / GenAI
View on GitHub
☆11Updated this week
yvonwin / qwen2.cpp
View on GitHub
qwen2 and llama3 cpp implementation
☆49Jun 7, 2024Updated last year
waylandzhang / embedding_from_scratch
View on GitHub
训练自己的中文 Embedding 模型
☆28Jan 6, 2025Updated last year
ckyang1124 / LALM-Evaluation-Survey
View on GitHub
Collection of works for evaluating (and analyzing) large audio-language models (LALMs)
☆40Aug 11, 2025Updated 6 months ago
OatmealLiu / Point-cloud-registration_RPA-project
View on GitHub
A comparison of using different feature descriptors (SI, SIFT, SHOT, CSHOT, FPFH) and different keypoints detection algorithm (SIFT3D, I…
☆17Feb 9, 2021Updated 5 years ago
awsaf49 / sonics
View on GitHub
[ICLR 2025] SONICS: Synthetic Or Not - Identifying Counterfeit Songs
☆43May 23, 2025Updated 8 months ago
john852517791 / pytorch_lightning_FAD
View on GitHub
This is a general framework for fake audio detection using pytorch lightning
☆27Jul 24, 2025Updated 6 months ago
MiniMax-AI / audio-tools
View on GitHub
A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…
☆44Apr 1, 2025Updated 10 months ago
GenerativeAgents / dify-book
View on GitHub
Difyで作る生成AIアプリ完全入門
☆17May 25, 2025Updated 8 months ago
6zzhh6 / WeChat_Formatting_Tool
View on GitHub
A simple WeChat Official Account layout tool based on Dify
☆16Jun 27, 2025Updated 7 months ago
BierOne / ood_coverage
View on GitHub
[ICLR 2024 Spotlight] Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization
☆34Oct 25, 2024Updated last year
HugoPalomares / design-intent-for-sdd
View on GitHub
☆28Dec 4, 2025Updated 2 months ago
c00cjz00 / llmservice_ip
View on GitHub
☆11Aug 29, 2025Updated 5 months ago
OneWave-AI / claude-skills
View on GitHub
100 Production-Ready Claude Code Skills - The most comprehensive collection of AI skills for sales, business automation, content creation…
☆35Oct 22, 2025Updated 3 months ago
NJUxlj / Chinese-MedQA-Qwen2
View on GitHub
基于Qwen2+SFT+DPO的医疗问答系统，项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练，其次，项目还调用各种知识库工具（neo4j, milvus, LDA, 等）进行自动化训练数据生成。另外，使用 vllm 用于推理…
☆60Jan 4, 2026Updated last month
ALucek / GRPO-Training
View on GitHub
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆36May 18, 2025Updated 8 months ago
Ruiqi-Yan / URO-Bench
View on GitHub
Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models
☆50Sep 2, 2025Updated 5 months ago
majinkai / dify-database-to-knowledge
View on GitHub
Write the database metadata into the dify knowledge
☆12Dec 30, 2025Updated last month
KenKaiii / b0t
View on GitHub
Workflow automation, but you just describe what you want and it happens.
☆26Nov 22, 2025Updated 2 months ago
roberthsu2003 / playwright_crawl4AI
View on GitHub
新世代的網路爬蟲
☆32Dec 13, 2025Updated 2 months ago
aws-samples / sample-data-analyst-bi
View on GitHub
A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…
☆25Jan 6, 2026Updated last month
Rongjiehuang / awesome-speech-to-speech-translation
View on GitHub
List of direct speech-to-speech translation papers.
☆38Jan 31, 2023Updated 3 years ago
TobyYang7 / Llava_Qwen2
View on GitHub
Visual Instruction Tuning for Qwen2 Base Model
☆41Jun 29, 2024Updated last year
ChenXi-Hu / Dify-GraphRag
View on GitHub
Use the knowledge graph generated by GraphRAG as the external knowledge base for the Dify workflow.
☆21Jun 4, 2025Updated 8 months ago
michalvavrecka / superkaraoker
View on GitHub
Makes karaoke from any youtube video link. The method is based on machine learning methods. After dowloading video from Youtube, the audi…
☆12Aug 6, 2023Updated 2 years ago
Yuto-Matsunaga / Prompt_Tuning_for_Audio_Deepfake_Detection
View on GitHub
☆12Nov 12, 2024Updated last year
creativescenius / java-a2a
View on GitHub
Java implementation for the Agent2Agent Protocol (A2A - https://github.com/google/A2A), enabling interaction between AI agents through a …
☆11Apr 21, 2025Updated 9 months ago
ourownstory / test-of-time
View on GitHub
A small framework to benchmark forecasting models via backtesting
☆13Nov 25, 2023Updated 2 years ago
wangle201210 / dify-retriever-mcp
View on GitHub
dify 知识库检索工具
☆13Apr 3, 2025Updated 10 months ago
PoTaTo-Mika / Shore-Data-Engine
View on GitHub
A codebase for data crawling and preprocessing for TTS and ASR systems training.
☆22Feb 5, 2026Updated last week
sliderSun / law_glm_baseline
View on GitHub
☆12Jun 28, 2024Updated last year
qxiaofan / awesome-sgbm-generate-cloud
View on GitHub
sgbm立体匹配算法以及生成点云
☆12Jan 29, 2021Updated 5 years ago
intellwe / ai-calling-agent
View on GitHub
A real-time voice AI system that integrates OpenAI's Realtime API, Llama3 with Twilio Voice to create intelligent voice conversations.
☆19Sep 6, 2025Updated 5 months ago
CXR-AL14 / CXR-Code
View on GitHub
☆13Sep 23, 2022Updated 3 years ago

owenliang / qwen2.5-0.5b-grpoView external linksLinks

Alternatives and similar repositories for qwen2.5-0.5b-grpo

owenliang / qwen2.5-0.5b-grpo
View external linksLinks