Complete Reinforcement Learning Toolkit for Large Language Models!
☆21Aug 2, 2025Updated 7 months ago
Alternatives and similar repositories for Q-Flow
Users that are interested in Q-Flow are comparing it to the libraries listed below
Sorting:
- Edge-weighted online bipartite matching (JACM 2022)☆12Jun 18, 2023Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Hybrid Linear UCB Multi-arm Bandit library☆14Oct 5, 2016Updated 9 years ago
- code submission to NeurIPS2019☆13Aug 9, 2023Updated 2 years ago
- ☆20Nov 3, 2024Updated last year
- Contains the code relative to the paper Partial Gromov-Wasserstein with Applications on Positive-Unlabeled Learning https://arxiv.org/abs…☆21Mar 3, 2020Updated 6 years ago
- Dateset Reset Policy Optimization☆31Apr 12, 2024Updated last year
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆37Mar 3, 2025Updated last year
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆39May 28, 2025Updated 9 months ago
- ☆10Nov 17, 2022Updated 3 years ago
- ☆16Apr 19, 2024Updated last year
- ☆11Aug 20, 2025Updated 6 months ago
- MATLAB code for Stein Point Markov Chain Monte Carlo.☆13Jul 3, 2019Updated 6 years ago
- Codes for NIPS 2019 Paper: Rethinking Kernel Methods for Node Representation Learning on Graphs☆34Feb 20, 2020Updated 6 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- nsfc - 国家自然科学基金项目LaTeX模版(面青地)☆10Jan 6, 2026Updated last month
- python越南语分词器☆10Nov 14, 2019Updated 6 years ago
- Fast Approximate Quadratic Assignment for (Brain) Graph Matching☆16Aug 23, 2016Updated 9 years ago
- ☆11Jan 12, 2023Updated 3 years ago
- GraphQL and Rest API rewrite of the current Open Targets platform API☆15Updated this week
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks☆12Aug 9, 2022Updated 3 years ago
- 腾讯安全智能渗透挑战赛获奖团队答辩材料及项目列表☆80Updated this week
- This repo contains the official code release of the Neural Experts paper, published in NeurIPS 2024.☆13Dec 3, 2024Updated last year
- 练习题,python 协同过滤ALS模型实现:商品推荐 + 用户人群放大☆10Jun 4, 2020Updated 5 years ago
- ☆11Apr 10, 2023Updated 2 years ago
- ☆12Oct 5, 2020Updated 5 years ago
- Udacity Self Driving Car Nanodegree - Vehicle Detection☆10Oct 30, 2018Updated 7 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated 11 months ago
- ☆11Mar 23, 2025Updated 11 months ago
- A Quasi-Wasserstein Loss for Learning Graph Neural Networks (QW loss)☆10May 20, 2024Updated last year
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆16Dec 8, 2025Updated 2 months ago
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago
- ☆13May 21, 2023Updated 2 years ago
- [WWW 25] USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset☆16Dec 12, 2024Updated last year
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆44Aug 6, 2024Updated last year
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Jan 2, 2024Updated 2 years ago
- ☆14Jan 24, 2025Updated last year