annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
☆20Apr 4, 2025Updated 11 months ago
Alternatives and similar repositories for minichatgpt
Users that are interested in minichatgpt are comparing it to the libraries listed below
Sorting:
- ☆11Feb 9, 2024Updated 2 years ago
- Transformer from scratch with einsum method☆11Jul 8, 2021Updated 4 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- ☆19May 14, 2025Updated 9 months ago
- OpenAI ROS☆12Mar 7, 2019Updated 7 years ago
- MySQL Query MCP server for AI assistants - execute read-only MySQL queries☆12May 31, 2025Updated 9 months ago
- 这是一个面向币圈新手的入门速通指南集合,包括最全面的币圈区块链资源集合,包含各类工具导航,快速了解币圈常用术语和行话,详细的防骗指南,助你规避各类风险☆19Feb 10, 2026Updated 3 weeks ago
- A real-time visual analysis software for depth tracking in noisy oil and gas environments.☆12Jan 13, 2026Updated last month
- A custom Docker image containing the theia-ide for Java development☆15Nov 30, 2021Updated 4 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- Perf monitoring CLI tool for Apple Silicon☆10Jan 25, 2023Updated 3 years ago
- An implementation of the AlphaZero algorithm by Google Deepmind. Research paper here: https://arxiv.org/abs/1911.08265☆12Oct 10, 2024Updated last year
- Complete educational guide with 100+ working code examples for building production-ready AI agents using the OpenAI Agents SDK. Learn t…☆16Jan 10, 2026Updated last month
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 2 years ago
- GRPC client CLI, like grpcurl, but in Rust; GRPC Client UI, like postman, but in Rust☆20Jan 29, 2026Updated last month
- GPT-J 6B inference on TensorRT with INT-8 precision☆11Apr 5, 2023Updated 2 years ago
- Extremely simple MoE implementation, mostly based off Switch Transformer☆13Feb 26, 2024Updated 2 years ago
- Codes for the ICLR 2022 paper: Trigger Hunting with a Topological Prior for Trojan Detection☆11Sep 19, 2023Updated 2 years ago
- Code for ACL2018 paper "Learn How to Actively Learn: An Imitation Learning Approach"☆10Mar 8, 2019Updated 7 years ago
- Hinton's Forward-Forward Algorithm for Deep Learning☆10Feb 6, 2023Updated 3 years ago
- Example implementation of Zeebe workflows using pyzeebe.☆12Jun 1, 2021Updated 4 years ago
- ☆15Jun 8, 2022Updated 3 years ago
- This project proposed a method to defense against adversarial attack. By combining the proposed preprocessing method with an adversariall…☆10Oct 4, 2018Updated 7 years ago
- Official implementation of the Informed Dreamer algorithm, based on DreamerV3☆19Jan 29, 2026Updated last month
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago
- sound stretch python module☆11May 1, 2019Updated 6 years ago
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆15Dec 11, 2023Updated 2 years ago
- GUARDRAIL - MCP Security - Gateway for Unified Access, Resource Delegation, and Risk-Attenuating Information Limits☆17Jul 21, 2025Updated 7 months ago
- LLM Proxy☆12Aug 26, 2024Updated last year
- Implementation of True Online TD(lambda) with a Fourier Basis function approximator.☆13May 9, 2015Updated 10 years ago
- Examples of Debug Adapter Protocol (DAP) implementation☆13Jun 22, 2021Updated 4 years ago
- ☆21Jun 22, 2025Updated 8 months ago
- DEPRECATED -- real-time co-operative LaTeX editing☆29Dec 15, 2011Updated 14 years ago
- ☆18Oct 18, 2022Updated 3 years ago
- Vision Foundation Models: SAM, ViT, CLIP, DINOv2, object detection, segmentation, and multimodal AI for computer vision.☆17Nov 10, 2025Updated 3 months ago
- an open-source audio processing library that allows changing the sound tempo, pitch and playback rate parameters independently from each…☆20Mar 22, 2019Updated 6 years ago
- ☆13Jan 27, 2019Updated 7 years ago
- ☆13Jul 5, 2023Updated 2 years ago
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year