A minimum example of aligning language models with RLHF similar to ChatGPT
โ226Sep 26, 2023Updated 2 years ago
Alternatives and similar repositories for minChatGPT
Users that are interested in minChatGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Anh - LAION's multilingual assistant datasets and modelsโ28Apr 5, 2023Updated 3 years ago
- ๐ฅ LG-AI-Challenge 2022 1์ ์๋ฃจ์ ์ ๋๋ค.โ13Jun 6, 2023Updated 2 years ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.โ91Nov 23, 2022Updated 3 years ago
- Official PyTorch implementation for "Effective and Efficient Masked Image Generation Models"โ32Apr 8, 2025Updated last year
- Fullstack machine learning inference templateโ31Nov 24, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean โข AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- โ13Jul 31, 2023Updated 2 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)โ4,743Jan 8, 2024Updated 2 years ago
- โ18Dec 18, 2022Updated 3 years ago
- Code base for internal reward models and PPO trainingโ24Oct 1, 2023Updated 2 years ago
- โ23Oct 30, 2023Updated 2 years ago
- โ16Nov 18, 2020Updated 5 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigsโฆโ183Jun 18, 2023Updated 2 years ago
- Blog of the LibreCV.orgโ11May 17, 2021Updated 4 years ago
- benchmarks for evaluating MT modelsโ11Jun 26, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available โข AdRun AI, ML, and HPC workloads on powerful cloud GPUsโwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingโ132Apr 17, 2024Updated 2 years ago
- TACO: TFBS-Aware Cis-Regulatory Element Optimizationโ21Aug 1, 2025Updated 8 months ago
- ICONIP2021 - A Vietnamese Medical Dataset for IC and NERโ25Aug 8, 2023Updated 2 years ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solvingโ24May 1, 2024Updated last year
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generationโ27Jun 7, 2024Updated last year
- ChatGPT solutions for the MLE interviewโ14Dec 9, 2022Updated 3 years ago
- Seq2seq using LSTM with attention from Luong et alโ10Oct 2, 2018Updated 7 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasksโ210Jan 13, 2024Updated 2 years ago
- Transformation spoken text to written textโ31May 14, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits โข AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for paper: "Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design"โ68May 10, 2025Updated 11 months ago
- c++ implementation of alphagozeroโ15May 29, 2018Updated 7 years ago
- ๐ฎ LLM GPU Calculatorโ21Aug 19, 2023Updated 2 years ago
- [NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Maskingโ30Mar 18, 2026Updated last month
- JAX notebook showing how to LoRA + GPTQ arbitrary modelsโ10Aug 8, 2023Updated 2 years ago
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLMโ7,873Oct 11, 2025Updated 6 months ago
- JAX implementation of GPTQ quantization algorithmโ10Jul 19, 2023Updated 2 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.โ22Nov 26, 2022Updated 3 years ago
- A very-minimal command-line parserโ20Jul 28, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting โข AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333โ1,163Jan 11, 2024Updated 2 years ago
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSMโฆโ21Jun 19, 2021Updated 4 years ago
- [ICML2025] The official implementation of "PolyConf: Unlocking Polymer Conformation Generation through Hierarchical Generative Models"โ34Mar 18, 2026Updated last month
- Evaluating genomic sequence models for explaining personalized expression variationโ20Dec 6, 2023Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.โ97Feb 9, 2023Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weightsโ19Oct 9, 2022Updated 3 years ago
- A list of totally open alternatives to ChatGPTโ4,757May 3, 2023Updated 2 years ago