annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
☆20Apr 4, 2025Updated last year
Alternatives and similar repositories for minichatgpt
Users that are interested in minichatgpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 4 years ago
- 中文文档理解多模态语言模型,支持多模态文档信息抽取,文档embedding☆12Jun 26, 2022Updated 3 years ago
- ☆24Apr 16, 2019Updated 7 years ago
- A Chinese characters recognition repository with tensorrt format supported based on CRNN_Chinese_Characters_Rec and TensorRTx.☆18Mar 11, 2021Updated 5 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG☆21Jan 14, 2022Updated 4 years ago
- the repository is about the conversion of CRNN model, which is widely used for text recognition. the CRNN model is converted from PyTorch…☆25Jun 26, 2020Updated 5 years ago
- Finetune Bloom big language model with Lora method☆32Jun 9, 2023Updated 3 years ago
- Codes for the ICLR 2022 paper: Trigger Hunting with a Topological Prior for Trojan Detection☆11Sep 19, 2023Updated 2 years ago
- A Golang client for FalkorDB☆22Updated this week
- ☆21May 14, 2025Updated last year
- Custom TensorFlow2 implementations of forward and backward computation of soft-DTW algorithm in batch mode.☆21Jun 7, 2021Updated 5 years ago
- A2A MCP Server is a lightweight Python bridge that lets Claude Desktop or any MCP client talk to A2A agents. It provides three tools: reg…☆21May 4, 2025Updated last year
- The Speech Rate Meter (hereinafter SRM) software module is designed to measure a complex of characteristics of the tempo (rate) of oral s…☆23Jul 11, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DEPRECATED -- real-time co-operative LaTeX editing☆29Dec 15, 2011Updated 14 years ago
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 3 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- Open source implementation of InstructGPT (not finished)☆31Apr 13, 2023Updated 3 years ago
- OpenAI ROS☆12Mar 7, 2019Updated 7 years ago
- oce☆34Dec 7, 2021Updated 4 years ago
- ☆14Nov 15, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆74Dec 8, 2025Updated 6 months ago
- FreeBuf文章代码示例☆13Oct 16, 2017Updated 8 years ago
- Perf monitoring CLI tool for Apple Silicon☆10Jan 25, 2023Updated 3 years ago
- LLM Proxy☆13Aug 26, 2024Updated last year
- An implement of SPEECHSPLIT☆15Sep 12, 2020Updated 5 years ago
- Extremely simple MoE implementation, mostly based off Switch Transformer☆13Feb 26, 2024Updated 2 years ago
- DFT-based text image rotation correction using OpenCV☆39Nov 25, 2013Updated 12 years ago
- A custom Docker image containing the theia-ide for Java development☆15Nov 30, 2021Updated 4 years ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆19Nov 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆26Jun 22, 2025Updated 11 months ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆141Jan 15, 2024Updated 2 years ago
- Code for training & inference with FLAN family of models☆17May 23, 2023Updated 3 years ago
- Transformer from scratch with einsum method☆11Jul 8, 2021Updated 4 years ago
- sound stretch python module☆11May 1, 2019Updated 7 years ago
- ☆13Jul 5, 2023Updated 2 years ago
- ☆16Apr 21, 2022Updated 4 years ago