annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
☆20Apr 4, 2025Updated last year
Alternatives and similar repositories for minichatgpt
Users that are interested in minichatgpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆13Aug 31, 2020Updated 5 years ago
- A repository for Chinese text normalization.☆20May 2, 2021Updated 5 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆28Dec 31, 2023Updated 2 years ago
- Finetune Bloom big language model with Lora method☆32Jun 9, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Codes for the ICLR 2022 paper: Trigger Hunting with a Topological Prior for Trojan Detection☆11Sep 19, 2023Updated 2 years ago
- Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.☆23Mar 14, 2019Updated 7 years ago
- Train tacotron on a mandarin dataset☆18May 6, 2019Updated 7 years ago
- ☆22May 14, 2025Updated last year
- DEPRECATED -- real-time co-operative LaTeX editing☆29Dec 15, 2011Updated 14 years ago
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 3 years ago
- ☆11Feb 9, 2024Updated 2 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- GUARDRAIL - MCP Security - Gateway for Unified Access, Resource Delegation, and Risk-Attenuating Information Limits☆17Jul 21, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 4 years ago
- GRPC client CLI, like grpcurl, but in Rust; GRPC Client UI, like postman, but in Rust☆21Jan 29, 2026Updated 5 months ago
- 这是一个面向币圈新手的入门速通指南集合,包括最全面的币圈区块链资源集合,包含各类工具导航,快速了解币圈常用术语和行话,详细的防骗指南,助你规避各类风险☆20May 7, 2026Updated last month
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Dec 13, 2022Updated 3 years ago
- OpenAI ROS☆12Mar 7, 2019Updated 7 years ago
- ☆14Nov 15, 2022Updated 3 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- Perf monitoring CLI tool for Apple Silicon☆10Jan 25, 2023Updated 3 years ago
- LLM Proxy☆14Aug 26, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Bug Report driven Program Repair☆16Feb 15, 2020Updated 6 years ago
- An implement of SPEECHSPLIT☆15Sep 12, 2020Updated 5 years ago
- Extremely simple MoE implementation, mostly based off Switch Transformer☆13Feb 26, 2024Updated 2 years ago
- Complete educational guide with 100+ working code examples for building production-ready AI agents using the OpenAI Agents SDK. Learn t…☆21Jan 10, 2026Updated 5 months ago
- A custom Docker image containing the theia-ide for Java development☆15Nov 30, 2021Updated 4 years ago
- ☆26Jun 22, 2025Updated last year
- Predict scalar coupling in molecules☆15Mar 14, 2021Updated 5 years ago
- sound stretch python module☆11May 1, 2019Updated 7 years ago
- ☆12Jul 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14May 3, 2022Updated 4 years ago
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆19Oct 13, 2024Updated last year
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆16Dec 11, 2023Updated 2 years ago
- Hinton's Forward-Forward Algorithm for Deep Learning☆10Feb 6, 2023Updated 3 years ago
- ☆55May 9, 2025Updated last year
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 9 months ago
- GPT-J 6B inference on TensorRT with INT-8 precision☆11Apr 5, 2023Updated 3 years ago