Scripts for training Qwen 2.5 VL with ms-swift and GRPO
☆12Feb 27, 2025Updated last year
Alternatives and similar repositories for agent-rl
Users that are interested in agent-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Code for riskmapr apps for invasive weed risk mapping☆10Jan 6, 2025Updated last year
- [CoRL 2025] Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild☆25Jan 23, 2026Updated 2 months ago
- ☆13Dec 9, 2024Updated last year
- Plot package similar to gnuplot☆23Mar 26, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is the official repository for paper “SAM-CDFFNet:SAM-based Cross-Dept Feature Fusion Net-work for Intelligent Identification of Lan…☆13Jul 16, 2024Updated last year
- Multiple Futures Prediction (MFP) on CARLA data☆12Apr 22, 2021Updated 4 years ago
- ☆15Apr 26, 2025Updated 11 months ago
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- ☆12Nov 12, 2024Updated last year
- A flexible and user-friendly tool designed to evaluate and benchmark image-based models.☆15May 15, 2025Updated 10 months ago
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- A Primer on Python for Statistical Programming and Data Science☆26Mar 26, 2019Updated 6 years ago
- ☆10Jul 27, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A curated list of resources about SOLID, the future of the Web!☆15Apr 25, 2019Updated 6 years ago
- My collection of dotfiles☆14Mar 16, 2026Updated last week
- ☆13Aug 11, 2018Updated 7 years ago
- Improving transparency of large language models' reasoning☆15Nov 25, 2025Updated 4 months ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- ☆12Oct 4, 2021Updated 4 years ago
- What do CLIP Vision Transformers learn? Feature Visualization can show you!☆15Aug 29, 2024Updated last year
- This repository collects papers related to Speech Tokenizer.☆17Oct 16, 2024Updated last year
- ☆14Jul 24, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Enemies for your LLM☆35Jan 20, 2026Updated 2 months ago
- Reasoning in Space via Grounding in the World (ICLR 2025)☆50Nov 3, 2025Updated 4 months ago
- da website☆11Mar 16, 2024Updated 2 years ago
- Project for training SSL-based deepfake speech detector☆47Mar 18, 2026Updated last week
- 2nd place solution for Xview2 challenge https://xview2.org/☆16Feb 27, 2020Updated 6 years ago
- ☆17Jan 29, 2026Updated last month
- A specification and user manual for the Intento API – a single API to Cognitive AI models from many vendors.☆41Jan 20, 2026Updated 2 months ago
- Lifting ControlNet for Generalized Depth Conditioning☆27Dec 28, 2023Updated 2 years ago
- Code for the paper "Reliability in Semantic Segmentation: Are We on the Right Track?", CVPR 2023☆23Jul 8, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python tools for working with DHI MIKE21☆28Apr 18, 2019Updated 6 years ago
- ☆14Feb 1, 2023Updated 3 years ago
- An Implementation of LoRa for EmComm (Emergency Communication) or (TacComm) Tactical Communication☆19Jul 23, 2025Updated 8 months ago
- Ressources for the session on machine learning and remote sensing at the OpenGeoHub Summer school in Münster 2019☆27Sep 3, 2019Updated 6 years ago
- A comapartive analysis of voice spoofing detection systems, based on a paper available at https://arxiv.org/abs/2210.00417.☆17Oct 24, 2022Updated 3 years ago
- Information Distillation Generative Adversrial Network in PyTorch☆27Jan 16, 2020Updated 6 years ago
- ☆43Sep 15, 2025Updated 6 months ago