Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆29Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for LLM-self-play
Users that are interested in LLM-self-play are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,239May 8, 2024Updated last year
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022☆10Jan 6, 2023Updated 3 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated 2 years ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,407Apr 11, 2024Updated 2 years ago
- lime-ner: extending LIME for Named Entity Recognition☆10Aug 15, 2018Updated 7 years ago
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated last year
- Lazy one's Flask application☆11Aug 13, 2016Updated 9 years ago
- ☆13May 25, 2023Updated 2 years ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- ☆11Jan 6, 2024Updated 2 years ago
- ☆19Jun 10, 2024Updated last year
- Unofficial Implementation of Evolutionary Model Merging☆42Mar 28, 2024Updated 2 years ago
- some mixture of experts architecture implementations☆27Mar 22, 2024Updated 2 years ago
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- Official implementation of AAAI22 paper "ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compr…☆10Mar 1, 2024Updated 2 years ago
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Minsk in VB☆11May 10, 2022Updated 3 years ago
- ☆11Jul 30, 2024Updated last year
- ☆13Apr 17, 2024Updated 2 years ago
- The system enables sophisticated coordination of multiple drones through natural language commands, visual inputs, and real-time environm…☆16Dec 15, 2025Updated 4 months ago
- It checks how secure the program you made is and shows how vulnerable your program is.☆20Apr 20, 2017Updated 9 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리☆17Jan 3, 2024Updated 2 years ago
- forza-telemetry-kafka-producer☆10May 2, 2022Updated 4 years ago
- ☆23Oct 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- We conduct a preregistered experiment to investigate whether fact checks provided by a large language model can serve as an effective mis…☆13Dec 14, 2024Updated last year
- Automata Theory. Building a RegExp machine☆12May 10, 2019Updated 6 years ago
- Rust FTL + WebRTC live streaming software.☆13Mar 12, 2022Updated 4 years ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆14Nov 10, 2025Updated 5 months ago
- A software based Oscilloscope / Vectorscope☆16Mar 28, 2020Updated 6 years ago
- Open-source project for converting the Bible into JSON for native languages. A collaborative platform for digitizing sacred texts, and ma…☆10May 14, 2024Updated last year