Complete Reinforcement Learning Toolkit for Large Language Models!
☆21Aug 2, 2025Updated 7 months ago
Alternatives and similar repositories for Q-Flow
Users that are interested in Q-Flow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 26, 2019Updated 6 years ago
- code submission to NeurIPS2019☆13Aug 9, 2023Updated 2 years ago
- Statistics on the space of asymmetric networks via Gromov-Wasserstein distance☆15Jun 13, 2020Updated 5 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 该仓库主要记录 NLP 算法工程师相关的 搜索引擎 学习笔记☆13Apr 9, 2022Updated 3 years ago
- ☆20Nov 3, 2024Updated last year
- 对 Java 语言的学习☆13Aug 22, 2018Updated 7 years ago
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆37Mar 3, 2025Updated last year
- MATLAB code for Stein Point Markov Chain Monte Carlo.☆13Jul 3, 2019Updated 6 years ago
- GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Op…☆28Dec 11, 2025Updated 3 months ago
- Code for "Evaluating Spatial Understanding of Large Language Models" TMLR 2024.☆16Feb 22, 2024Updated 2 years ago
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- Official code of the MSF model for GZSSAR (ICIG 2023)☆14Jan 3, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Codes for NIPS 2019 Paper: Rethinking Kernel Methods for Node Representation Learning on Graphs☆34Feb 20, 2020Updated 6 years ago
- "Applying Regularized Schrödinger-Bridge-Based Stochastic Process in Generative Modeling"☆11Aug 16, 2022Updated 3 years ago
- Official implementation of the paper "STARS: Self-supervised 3D Action Recognition with Contrastive Tuning".☆16Jan 6, 2025Updated last year
- Codebase of the paper "Aligning Protein Conformation Ensemble Generation with Physical Feedback" (ICML 2025)☆16Jul 6, 2025Updated 8 months ago
- Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos☆11Feb 10, 2026Updated last month
- 使用 CUDA C++ 实现的 llama 模型推理框架☆63Nov 8, 2024Updated last year
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【问答篇】☆22Mar 22, 2023Updated 3 years ago
- ☆11Apr 10, 2023Updated 2 years ago
- A Quasi-Wasserstein Loss for Learning Graph Neural Networks (QW loss)☆10May 20, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ACMMM 2024] An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation☆16Mar 15, 2025Updated last year
- Geometric Algebra Flow Matching (GAFL) for Protein Backbone Generation☆18Oct 31, 2025Updated 4 months ago
- Learnable Global Pooling Layers Based on Regularized Optimal Transport (ROT)☆16Mar 17, 2024Updated 2 years ago
- ☆14Aug 21, 2025Updated 7 months ago
- ☆24Jan 30, 2025Updated last year
- ☆19Jul 7, 2024Updated last year
- torch7 wrapper for knn CUDA code☆10Dec 1, 2014Updated 11 years ago
- Software demonstrations for a course on Topological Data Analysis.☆12Mar 26, 2021Updated 4 years ago
- Ouroboros: On Accelerating Training of Transformer-Based Language Models☆10Nov 7, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Feb 25, 2018Updated 8 years ago
- 中文soft-masked bert文本纠错复现☆21May 20, 2021Updated 4 years ago
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- Single Shot Super-Resolution with Shape from Shading using Opt-Solver and GPU acceleration☆15Jan 7, 2020Updated 6 years ago
- This repo contains the official code release of the Neural Experts paper, published in NeurIPS 2024.☆13Dec 3, 2024Updated last year
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆19Sep 6, 2023Updated 2 years ago
- Source code for Noise-Contrastive Estimation for Multivariate Point Processes (NeurIPS 2020).☆15Nov 3, 2020Updated 5 years ago