thomasgauthier / LLM-self-playView external linksLinks
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆29Mar 1, 2024Updated last year
Alternatives and similar repositories for LLM-self-play
Users that are interested in LLM-self-play are comparing it to the libraries listed below
Sorting:
- ☆13May 25, 2023Updated 2 years ago
- MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series☆17Sep 5, 2025Updated 5 months ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,234May 8, 2024Updated last year
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆41Sep 24, 2024Updated last year
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- Just a subfolder of https://github.com/siliconflow/onediff☆23Jun 24, 2024Updated last year
- 📰 Computing the information content of trained neural networks☆22Oct 8, 2021Updated 4 years ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,407Apr 11, 2024Updated last year
- quick playground to animate pippin☆14Nov 11, 2024Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- ☆28Apr 3, 2025Updated 10 months ago
- ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities☆35Dec 14, 2022Updated 3 years ago
- In this model I have created a basic AI chatbot Interface with External plugin abilities; with visual basic An Interface AI_Contracts en…☆10May 2, 2021Updated 4 years ago
- A tool to paste Excel ranges to Reddit☆11Sep 20, 2025Updated 4 months ago
- The first OpenSource Mafia Bot!☆10Oct 5, 2023Updated 2 years ago
- RL algorithm for stock trading with multiple reward functions☆11Apr 21, 2024Updated last year
- Open-source project for converting the Bible into JSON for native languages. A collaborative platform for digitizing sacred texts, and ma…☆10May 14, 2024Updated last year
- Use MobileNet SSD and openCV to detect and count car on road☆12Jan 13, 2020Updated 6 years ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- Comparative Study and Implementation of Five Factor Model and Myers-Briggs Type Indicator Model☆11Sep 28, 2023Updated 2 years ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆35Mar 19, 2024Updated last year
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- DNH Werewolf Discord bot☆13Dec 19, 2024Updated last year
- ☆10Jul 21, 2019Updated 6 years ago
- 李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。☆10Jun 3, 2025Updated 8 months ago
- ☆12Aug 6, 2024Updated last year
- A comprehensive ELT pipeline for analyzing passenger satisfaction data. Features a modern data architecture with Apache Airflow for extra…☆12Oct 5, 2025Updated 4 months ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- ☆16Sep 17, 2024Updated last year
- 小鸡词典🐤的Alfred🎩插件 咯咯咯☆11Apr 19, 2023Updated 2 years ago
- FinanceGPT-B☆10Mar 26, 2024Updated last year
- Cookiecutter template for making a cog for Red.☆12Jun 18, 2024Updated last year
- Personalized all-purpose AI assistance platform based on hierarchical cooperative multi-agent framework which utilizes websocket connecti…☆39Aug 11, 2024Updated last year
- This project is focus on stock prediction,our goal is implementing one trading framework using DRL with LSTM.☆11Jun 1, 2018Updated 7 years ago
- Vietnamese GPT-J API service deployed with Docker & Helm chart☆10Dec 11, 2022Updated 3 years ago
- Dataset and codes for SEntFiN☆10May 31, 2023Updated 2 years ago