☆46Aug 9, 2024Updated last year
Alternatives and similar repositories for Simple_LLM_PPO
Users that are interested in Simple_LLM_PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆76Nov 13, 2023Updated 2 years ago
- 使用单个24G显卡,从0开始训练LLM☆55Jul 9, 2025Updated 9 months ago
- Code for AAAI2020 paper: "Revision in Continuous Space: Unsupervised Text Style Transfer without Adversarial Learning"☆21Sep 28, 2020Updated 5 years ago
- ☆17Apr 8, 2024Updated 2 years ago
- A simple deep learning framework inspired by Dezero and PyTorch☆31Jan 27, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repository contains the code and pre-trained models for our paper☆23Jun 29, 2025Updated 9 months ago
- SoulStar 是一个心理咨询大模型,内核为温柔知心的大姐姐,能详细分析倾诉的问题,给出切实的建议和安慰,并有可爱表情和颜文字回复~~(*╹▽╹*)☆33Mar 3, 2024Updated 2 years ago
- ☆33Sep 19, 2025Updated 6 months ago
- ☆13Sep 2, 2023Updated 2 years ago
- ROS/Gazebo controller of a Quadrotor controlled by feedaback linearization and artificial potential field☆10Sep 1, 2020Updated 5 years ago
- ☆12Jul 25, 2023Updated 2 years ago
- A simple ROS-Gazebo package provides a quick headstart for testing high level path planning / visual servoing algorithms on multiple fixe…☆12Feb 28, 2020Updated 6 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Jun 23, 2022Updated 3 years ago
- ☆13Jul 20, 2021Updated 4 years ago
- Quasi-steady aerodynamics and control for flapping wing UAV on a Lie group☆13Nov 29, 2023Updated 2 years ago
- ☆20Jan 26, 2026Updated 2 months ago
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆45Mar 18, 2026Updated 3 weeks ago
- TweetFinSent: A Dataset of Stock Sentiments on Twitter☆13Jul 7, 2022Updated 3 years ago
- Automated detection of exudates from fundus images plays an important role in diabetic retinopathy (DR) screening and evaluation, for whi…☆11Dec 11, 2020Updated 5 years ago
- Multicultural Proverbs and Sayings☆13Jan 11, 2025Updated last year
- Web one-click mode full process platform, including train data upload, fine-tuning, model merge, model deploy, gpu monitor etc., no need …☆19Nov 28, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆21Mar 12, 2026Updated last month
- Autonomous Quadrotor Simulation Research Platform based on ROS,Gazebo,Pixhawk,DRCsim..☆15May 19, 2019Updated 6 years ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated 2 years ago
- The official MATLAB implementation of IEEE Transactions on Multimedia 2020 paper "Pixel-level Non-local Image Smoothing with Objective E…☆19Nov 22, 2020Updated 5 years ago
- ☆19Jun 13, 2024Updated last year
- This project utilizes deep reinforcement learning techniques to train a robot, which combines a mobile platform and a Panda robotic arm, …☆10Jun 7, 2023Updated 2 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Tuning BERT☆10Jun 28, 2022Updated 3 years ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆41Jun 23, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Sep 25, 2021Updated 4 years ago
- [NLPCC 2024] Shared Task 10: Regulating Large Language Models☆14Jun 12, 2024Updated last year
- ☆13May 12, 2025Updated 11 months ago
- Causal Inference for Time Series Data (with CausalML Demo)☆14Jun 11, 2023Updated 2 years ago
- pmr aims to register a pointcloud to a CAD model☆10Oct 28, 2018Updated 7 years ago
- 完全依靠ChatGPT生成数据微调的西式翻译腔聊天风格中文大模型☆21Apr 1, 2024Updated 2 years ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆47Dec 17, 2025Updated 3 months ago