yuniaXian / ppo_llm_DeepSpeedLinks

Customized llm PPO (reinforcement learning) pipeline with deepSpeed. For Amex external usage. Training reward model, actor-critic models with referenced supervised fine-tuned model

☆1

Alternatives and similar repositories for ppo_llm_DeepSpeed

Users that are interested in ppo_llm_DeepSpeed are comparing it to the libraries listed below

Sorting:

yuniaXian / fairseq_kg
Implement of Knowledge graph to text model. Integrated with Fairseq (Meta Fair research library))
☆2Updated last year
ripsweet / usaco-solutions-sale
☆1Updated last year
ehsantg / telegram-admin-counter
find channel admin count
☆21Updated 7 years ago
Firstzada / Discord-Bot-Base
Typescript command handler
☆23Updated last year
oxtx / extension-crypto
extension of SMx crypto support for go standard lib
☆2Updated 2 years ago
szykor18 / Lottery
Full Stack Lottery Web Application
☆2Updated 6 months ago
ehsantg / react-tags
A fantastically simple tagging component for your React projects
☆3Updated 6 years ago
yuniaXian / rasa_calm_chatbot
💬 Customized rasa chatbot framework based on llm to automate text- and voice-based conversations
☆1Updated last year
yuniaXian / llm_langchain_projects
Collection of llm_langchain_projects: Autolabelling, Search and Indexing
☆6Updated last year
oxtx / DexApp
DEX platform - Zuniswap
☆3Updated last year
IlyasMohetna / GSB_laboratoire
☆3Updated last year
Omid774 / Pick-Photo-from-Photos-Library
Pick Photo from iPhone Photos Library.
☆3Updated 3 years ago
ehsantg / inline-like-bot
Telegram Inline Like Bot PHP
☆21Updated 8 years ago
oxtx / CryptoTrader
crypto trader bot
☆2Updated 2 years ago
roeintheglasses / valpapers
valpapers repo
☆4Updated last year
CerberusChaos / create-starknet-dapp
create-starknet is a tool to quickly start a project from a basic template for popular frameworks.
☆1Updated last year
Cloudbit-Global / core
GO implementation of the Cloudbit Classic (CDBC) ecosystem
☆12Updated 8 months ago
Omid774 / NewsApiApp
☆8Updated last year
GrowTax / Growtopia-Eternity-Stealer
GT Stealer
☆5Updated last year
Omid774 / MoviesAPI-Practice
this is a practice for MoviesAPI for deep and better learning code and algorithm.
☆2Updated last year
neals-sudo / PseudoGenius-AI
Join the coding revolution! Unleash your potential and transform the way you program with AI-powered pseudocode conversion across languag…
☆3Updated last year
jevgenimarenkov / guideliner-server
The usability and accessibility evaluation tool
☆6Updated last year
dostogircse171 / solvyy-comparison-table
A nice looking responsive Pure HTML, CSS, JS comparison table
☆5Updated last year
dostogircse171 / amazon_product_scrap
Amazon product Scraping using Django and Selenium
☆4Updated last year
Lokistic / undercover-osu
undercover.host OSU! Cheat / 2021 version
☆1Updated last year
pub-calculator-io / distance-calculator
Free WordPress Plugin: These calculators find the distance between two points on a 2D plane, in a 3D space, as well as along the surface …
☆376Updated last year
Lokistic / undercover-csgo
undercover.host CS:GO Cheat / 2021 version
☆2Updated last year
pub-calculator-io / feet-and-inches-calculator
Free WordPress Plugin: A feet and inches calculator helps with math problems. Add feet and inches, subtract, multiply, or divide them wit…
☆19Updated last year
pub-calculator-io / investment-calculator
Free WordPress Plugin: This free investment calculator considers the initial and ending balances, return rate, and investment time when e…
☆351Updated last year
komeilkma / Terminator-Samurai-VPN
Hidden VPN protocol to avoid large scale blocking of TLS-based censorship circumvention
☆85Updated 2 years ago