ethanyanjiali / minChatGPTView external linksLinks
A minimum example of aligning language models with RLHF similar to ChatGPT
β226Sep 26, 2023Updated 2 years ago
Alternatives and similar repositories for minChatGPT
Users that are interested in minChatGPT are comparing it to the libraries listed below
Sorting:
- Anh - LAION's multilingual assistant datasets and modelsβ27Apr 5, 2023Updated 2 years ago
- π₯ LG-AI-Challenge 2022 1μ μ루μ μ λλ€.β13Jun 6, 2023Updated 2 years ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.β90Nov 23, 2022Updated 3 years ago
- β18Dec 18, 2022Updated 3 years ago
- β13Jul 31, 2023Updated 2 years ago
- ZYN: Zero-Shot Reward Models with Yes-No Questionsβ35Aug 15, 2023Updated 2 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)β4,742Jan 8, 2024Updated 2 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.β22Nov 26, 2022Updated 3 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generationβ27Jun 7, 2024Updated last year
- β10Oct 11, 2022Updated 3 years ago
- Code base for internal reward models and PPO trainingβ24Oct 1, 2023Updated 2 years ago
- β16Dec 21, 2023Updated 2 years ago
- JAX implementation of GPTQ quantization algorithmβ10Jul 19, 2023Updated 2 years ago
- β12Feb 9, 2022Updated 4 years ago
- JAX notebook showing how to LoRA + GPTQ arbitrary modelsβ10Aug 8, 2023Updated 2 years ago
- Seq2seq using LSTM with attention from Luong et alβ10Oct 2, 2018Updated 7 years ago
- benchmarks for evaluating MT modelsβ11Jun 26, 2024Updated last year
- Blog of the LibreCV.orgβ11May 17, 2021Updated 4 years ago
- ChatGPT solutions for the MLE interviewβ14Dec 9, 2022Updated 3 years ago
- Deterministic Acyclic Finite State Automaton implementation for morphological analysisβ18Dec 17, 2020Updated 5 years ago
- Fullstack machine learning inference templateβ31Nov 24, 2023Updated 2 years ago
- TPUμμ νκ΅μ΄μ© LLM μΆλ‘ μ μν Jax/Flax ꡬν체μ λλ€.β12Jun 12, 2023Updated 2 years ago
- AI for Mathematics Paper Listβ17Jan 14, 2025Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigsβ¦β182Jun 18, 2023Updated 2 years ago
- Converts LinkedIn API results to a lightly opinionated JSON RΓ©sumΓ© output.β15Sep 6, 2016Updated 9 years ago
- 컀λ²λ¦¬μ€νΈ - λΆ μ»€λ² μμ± AI μλΉμ€β13Sep 11, 2022Updated 3 years ago
- Ollama Mistral with Langchain RAG Agent and Custom toolsβ11Jul 6, 2024Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasksβ209Jan 13, 2024Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weightsβ19Oct 9, 2022Updated 3 years ago
- a Jax/Flax inference code of StarCoderβ12Jun 12, 2023Updated 2 years ago
- SKT'22 AI Fellowship, λ₯λ¬λ κΈ°λ° νλ°± μ΄λ―Έμ§ 컬λ¬ν κΈ°μ κ°λ°β13Jun 7, 2023Updated 2 years ago
- Inference code for LLaMA models in JAXβ120May 21, 2024Updated last year
- Serving large language model with transformersβ13Oct 18, 2022Updated 3 years ago
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt anβ¦β30Apr 4, 2023Updated 2 years ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solvingβ24May 1, 2024Updated last year
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333β1,143Jan 11, 2024Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β96Feb 9, 2023Updated 3 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)β173Apr 7, 2023Updated 2 years ago
- A very-minimal command-line parserβ20Jul 28, 2025Updated 6 months ago