annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
☆20Apr 4, 2025Updated last year
Alternatives and similar repositories for minichatgpt
Users that are interested in minichatgpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 4 years ago
- ODSC 2023 workshop materials on causal graphs using implementations of DoWhy (PyWhy, EconML)☆13Nov 1, 2023Updated 2 years ago
- .NET utility for displaying of cookie warning☆12Dec 14, 2015Updated 10 years ago
- It's my resume! Visit releases tab for PDF :)☆13Feb 25, 2026Updated 2 months ago
- ☆24Apr 16, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Chinese characters recognition repository with tensorrt format supported based on CRNN_Chinese_Characters_Rec and TensorRTx.☆18Mar 11, 2021Updated 5 years ago
- 通过人脸识别定位身份证获取身份证号☆18Feb 22, 2018Updated 8 years ago
- A repository for Chinese text normalization.☆20May 2, 2021Updated 5 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- Azure Web Site extension for logs☆18Apr 24, 2022Updated 4 years ago
- 机器学习资源☆16May 12, 2020Updated 6 years ago
- DigitalOcean Snapshot Hourly Automation. You don't need to manually take snapshots and delete old now.☆10May 5, 2019Updated 7 years ago
- My personal website.☆12Feb 24, 2023Updated 3 years ago
- Recommendations how to solve/debug CORS issues, when Keycloak IDP is used☆16Dec 7, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Wearable Data Layer example for DataMap objects☆15Jun 5, 2016Updated 9 years ago
- Android Sceneform integration with the Bullet Physics engine☆10Dec 15, 2018Updated 7 years ago
- Linear Gauge chart type for Chart.js☆14May 10, 2020Updated 6 years ago
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆26Sep 20, 2023Updated 2 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- ☆27Jul 13, 2022Updated 3 years ago
- We found a bass. This is how we did it. -- Amazing Anglers☆15Aug 2, 2016Updated 9 years ago
- Finetune Bloom big language model with Lora method☆32Jun 9, 2023Updated 2 years ago
- Wearable data layer example for text messages☆13Nov 11, 2015Updated 10 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Codes for the ICLR 2022 paper: Trigger Hunting with a Topological Prior for Trojan Detection☆11Sep 19, 2023Updated 2 years ago
- Docker image for orchestrating backup of data and mysql containers, using duplicity at its core.☆24Aug 23, 2017Updated 8 years ago
- Examples and exercises based on some of the features of Java 10 (GA and Early Access builds)☆19Jun 23, 2018Updated 7 years ago
- Code for ACL2018 paper "Learn How to Actively Learn: An Imitation Learning Approach"☆10Mar 8, 2019Updated 7 years ago
- ☆14Apr 4, 2017Updated 9 years ago
- A Golang client for FalkorDB☆21May 11, 2026Updated last week
- A django app that contains the mosaico frontend and implements the mosaico backend☆18Apr 26, 2017Updated 9 years ago
- Sample Android app demonstrating OpenGL-ES 2.0 shader programs☆30Jul 27, 2016Updated 9 years ago
- ☆21Sep 2, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Sample Payment implementation using Stripe☆15Aug 7, 2015Updated 10 years ago
- Python tools to process rowing (the sport) data☆16May 14, 2026Updated last week
- Port Forward From Android Device on any Internet Gateway Device (Router) that has the UPNP option enabled.☆22Nov 3, 2014Updated 11 years ago
- ☆20May 14, 2025Updated last year
- The Speech Rate Meter (hereinafter SRM) software module is designed to measure a complex of characteristics of the tempo (rate) of oral s…☆23Jul 11, 2024Updated last year
- A2A MCP Server is a lightweight Python bridge that lets Claude Desktop or any MCP client talk to A2A agents. It provides three tools: reg…☆21May 4, 2025Updated last year
- DEPRECATED -- real-time co-operative LaTeX editing☆29Dec 15, 2011Updated 14 years ago