☆19Aug 9, 2024Updated last year
Alternatives and similar repositories for Simple_TRL
Users that are interested in Simple_TRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆76Nov 13, 2023Updated 2 years ago
- 基于DPO算法微调语言大模型,简单好上手。☆51Jul 3, 2024Updated last year
- ☆46Aug 9, 2024Updated last year
- A track playback software for csv file developed by QT☆11Feb 6, 2022Updated 4 years ago
- ☆36Mar 25, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- code for☆11Apr 10, 2021Updated 5 years ago
- 目前各大高校领域将各种信息分布在不同的部门信息门户下,存在典型的信息孤岛问题,各个部门信息没有形成互通。当前,老师和学生存在很多有关本校相关文件、政策和活动等众多方面智能问答的统一入口的需求,例如财务处、人事处、学工处、教务处、图书馆等存在各种政策和文件规定,目前在校师生都…☆36Aug 5, 2024Updated last year
- Learning Evasion Strategy in Pursuit-Evasion by Deep Q-Network, ICPR2018.☆13Dec 22, 2018Updated 7 years ago
- ☆11Aug 13, 2024Updated last year
- 基于ROS的多无人机协同控制☆12May 8, 2021Updated 4 years ago
- Code and dataset for the paper 'Optimized Prediction of Weapon Effectiveness in BVR Air Combat Scenarios Using Enhanced Regression Models…☆17Jun 29, 2025Updated 9 months ago
- Interactive Multi-Agent Reinforcement Learning Environment for the board game Gobblet using PettingZoo.☆12Jul 2, 2023Updated 2 years ago
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- triangle is a tiny 3D render engine for learning 3D render technologies.☆13Apr 17, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The code for paper 'Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tr…☆21Aug 18, 2023Updated 2 years ago
- Application of DDPG on Pursuit-Evasion Problem☆13Feb 3, 2021Updated 5 years ago
- A simple 3D WebGIS Application code set for the primary learners.☆12Apr 30, 2022Updated 3 years ago
- yolov3 for rubbish detection☆15Jun 22, 2022Updated 3 years ago
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated last year
- ☆14Mar 30, 2021Updated 5 years ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- SSD(single shot multibox detector) data augment -- python☆18Nov 19, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Sep 18, 2024Updated last year
- ☆12Mar 2, 2022Updated 4 years ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆22Oct 14, 2025Updated 6 months ago
- my final work in NLP class☆14Dec 22, 2024Updated last year
- java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference☆13Sep 4, 2023Updated 2 years ago
- Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL☆18Nov 21, 2023Updated 2 years ago
- Awesome-Text2Motion-Generation☆18Oct 26, 2023Updated 2 years ago
- ☆17Nov 29, 2023Updated 2 years ago
- gRPC examples in C++11☆11Aug 1, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Springboot与Thymeleaf模板引擎整合☆13Sep 27, 2017Updated 8 years ago
- ☆10Apr 15, 2023Updated 3 years ago
- The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"☆21Jul 21, 2025Updated 8 months ago
- ☆12Jul 9, 2025Updated 9 months ago
- ☆23Apr 16, 2024Updated 2 years ago
- Yolov5 tensorflow实现☆12Sep 4, 2020Updated 5 years ago
- 在您的机器上本地离线运行 AI 模型☆11May 8, 2025Updated 11 months ago