annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
☆20Apr 4, 2025Updated 11 months ago
Alternatives and similar repositories for minichatgpt
Users that are interested in minichatgpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 3 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- 🎉🎨 This repository contains a reading list of papers on Embodied AI, including LLM/MLLM/VLA.☆13Aug 18, 2025Updated 7 months ago
- 通过人脸识别定位身份证获取身份证号☆18Feb 22, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A repository for Chinese text normalization.☆20May 2, 2021Updated 4 years ago
- ☆27May 27, 2017Updated 8 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- Finetune Bloom big language model with Lora method☆32Jun 9, 2023Updated 2 years ago
- Codes for the ICLR 2022 paper: Trigger Hunting with a Topological Prior for Trojan Detection☆11Sep 19, 2023Updated 2 years ago
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆45May 12, 2025Updated 10 months ago
- Train tacotron on a mandarin dataset☆18May 6, 2019Updated 6 years ago
- ☆19May 14, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Custom TensorFlow2 implementations of forward and backward computation of soft-DTW algorithm in batch mode.☆21Jun 7, 2021Updated 4 years ago
- A2A MCP Server is a lightweight Python bridge that lets Claude Desktop or any MCP client talk to A2A agents. It provides three tools: reg…☆21May 4, 2025Updated 10 months ago
- DEPRECATED -- real-time co-operative LaTeX editing☆29Dec 15, 2011Updated 14 years ago
- The Speech Rate Meter (hereinafter SRM) software module is designed to measure a complex of characteristics of the tempo (rate) of oral s…☆23Jul 11, 2024Updated last year
- ☆16Nov 30, 2022Updated 3 years ago
- ☆11Feb 9, 2024Updated 2 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- GUARDRAIL - MCP Security - Gateway for Unified Access, Resource Delegation, and Risk-Attenuating Information Limits☆18Jul 21, 2025Updated 8 months ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- GRPC client CLI, like grpcurl, but in Rust; GRPC Client UI, like postman, but in Rust☆21Jan 29, 2026Updated 2 months ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- ☆23Jun 22, 2025Updated 9 months ago
- ☆14May 3, 2022Updated 3 years ago
- Example implementation of Zeebe workflows using pyzeebe.☆12Jun 1, 2021Updated 4 years ago
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Dec 13, 2022Updated 3 years ago
- OpenAI ROS☆12Mar 7, 2019Updated 7 years ago
- FreeBuf文章代码示例☆13Oct 16, 2017Updated 8 years ago
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆22Apr 19, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Anomaly detection using RAG☆17Apr 22, 2024Updated last year
- Complete educational guide with 100+ working code examples for building production-ready AI agents using the OpenAI Agents SDK. Learn t…☆17Jan 10, 2026Updated 2 months ago
- Perf monitoring CLI tool for Apple Silicon☆10Jan 25, 2023Updated 3 years ago
- LLM Proxy☆12Aug 26, 2024Updated last year
- Bug Report driven Program Repair☆17Feb 15, 2020Updated 6 years ago
- An implement of SPEECHSPLIT☆15Sep 12, 2020Updated 5 years ago
- Extremely simple MoE implementation, mostly based off Switch Transformer☆13Feb 26, 2024Updated 2 years ago