LiaoMengqi/LLM4Game24

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LiaoMengqi/LLM4Game24)

LiaoMengqi / LLM4Game24

Long CoT Fine-Tuning and Reinforcement Learning for LLMs in the Context of the 24-Point Game: A Toy Project

☆27

Alternatives and similar repositories for LLM4Game24

Users that are interested in LLM4Game24 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WenkeHuang / SDEA
View on GitHub
ICML 2024 - Self-Driven Entropy Aggregation for Byzantine-Robust Heterogeneous Federated Learning
☆10Jul 16, 2024Updated 2 years ago
adityavaishampayan / SFM_python
View on GitHub
Structure From Motion
☆10Nov 22, 2022Updated 3 years ago
kyrie-23 / linear_task_arithmetic
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
ymguo21 / FedIIR
View on GitHub
Official PyTorch implementation for the ICML 2023 paper "Out-of-Distribution Generalization of Federated Learning via Implicit Invariant …
☆14Oct 31, 2023Updated 2 years ago
callous-youth / IAPTT-GM
View on GitHub
Code Repository for NeurIPS 2021 accepted paper, named "Torwards Gradient-based Bilevel Optimization with non-convex Followers and Beyond…
☆11Mar 28, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
YuchenLiu-a / byzantine-gas
View on GitHub
Official Implementation of ICML'23 "Byzantine-Robust Learning on Heterogeneous Data via Gradient Splitting".
☆15Jun 9, 2023Updated 3 years ago
SamuelHorvath / Variance_Reduced_Optimizers_Pytorch
View on GitHub
PyTorch Implementation of Variance Reduced Optimization Algorithms -- SARAH and SVRG.
☆15Jul 11, 2021Updated 5 years ago
JunjieYang97 / MRVRBO
View on GitHub
Example Code for paper "Provably Faster Algorithms for Bilevel Optimization"
☆15Dec 28, 2021Updated 4 years ago
loyiv / ITP
View on GitHub
Code of Paper: Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
☆16Mar 17, 2026Updated 4 months ago
MingruiLiu-ML-Lab / Bilevel-Coreset-Selection-via-Regularization
View on GitHub
[NeurIPS 2023] Bilevel Coreset Selection in Continual Learning: A New Formulation and Algorithm
☆15Nov 23, 2023Updated 2 years ago
hanruiqian / Awesome-Federated-LLM-Related-Works
View on GitHub
☆17Oct 12, 2023Updated 2 years ago
desternylin / perfed
View on GitHub
☆19Feb 20, 2024Updated 2 years ago
epfml / byzantine-robust-noniid-optimizer
View on GitHub
☆18Feb 2, 2022Updated 4 years ago
jarvisyjw / HKUST_latex_PQE_2022
View on GitHub
[HKUST Template] A latex template for PhD Qualification Exam (AKA PQE), especially for ECE from 2022 and later.
☆16Jan 2, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Zhaoxian-Wu / IOS
View on GitHub
Code for paper "Byzantine-Resilient Decentralized Stochastic Optimization with Robust Aggregation Rules"
☆20Apr 19, 2024Updated 2 years ago
Macintoshxz / books
View on GitHub
☆14Sep 25, 2021Updated 4 years ago
nikolakon / Robust-Learning-from-Untrusted-Sources
View on GitHub
Code for reproducing the experiments of the ICML 2019 paper "Robust Learning from Untrusted Sources"
☆18Jul 5, 2019Updated 7 years ago
Lydia-yang / FedDPA
View on GitHub
☆26Dec 2, 2024Updated last year
Jayfeather1024 / Backdoor-Enhanced-Alignment
View on GitHub
☆24Dec 8, 2024Updated last year
ucr-optml / FedNest
View on GitHub
Federated Bilevel Optimization
☆15Jun 23, 2022Updated 4 years ago
mkantwala / DeepSeek-R1-TrainingSuite
View on GitHub
Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…
☆13Jan 29, 2025Updated last year
datawhalechina / cs336-tutorial
View on GitHub
☆15Jul 10, 2025Updated last year
swiss-ai / pretrain-code
View on GitHub
Pretraining codebase for Apertus models, based on Megatron-LM
☆21Sep 25, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
louieworth / trd
View on GitHub
Official Implementation of Trajectory-Refined Distillation
☆30Jun 9, 2026Updated last month
swiss-ai / posttraining
View on GitHub
☆18Jul 17, 2026Updated last week
IBM / SafeLoRA
View on GitHub
Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"
☆29Dec 21, 2025Updated 7 months ago
zjhellofss / kuiperbook
View on GitHub
☆17Apr 23, 2026Updated 3 months ago
hkust-spark / xg-glass-sdk
View on GitHub
One API for smart-glasses apps - camera, mic, display, audio - across Rokid, Meta Ray-Ban, Brilliant Labs Frame, RayNeo, Even Realities G…
☆35Updated this week
darglein / TinyTorch
View on GitHub
A Minimalistic Auto-Diff Optimization Framework for Teaching and Understanding Pytorch
☆27Jul 9, 2026Updated 2 weeks ago
Jiacheng-Zhu-AIML / AsymmetryLoRA
View on GitHub
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models
☆40Feb 27, 2024Updated 2 years ago
machine981 / SCOPE
View on GitHub
SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting
☆28Jun 22, 2026Updated last month
Elvin-Ma / distributed_training
View on GitHub
Large-scale model distributed training technology
☆16Jul 13, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
stellarfeline / Bert-VITS2-Cook-Book
View on GitHub
Documentation for Bert-VITS2
☆21Nov 29, 2023Updated 2 years ago
futureverse / future.mapreduce
View on GitHub
[EXPERIMENTAL] R package: future.mapreduce - Utility Functions for Future Map-Reduce API Packages
☆12Apr 7, 2026Updated 3 months ago
yedaotian9 / Lite-OPD
View on GitHub
☆36May 17, 2026Updated 2 months ago
HenrikBengtsson / port4me
View on GitHub
port4me - Get the Same, Personal, Free TCP Port over and over
☆13Mar 1, 2024Updated 2 years ago
moodymudskipper / pkg
View on GitHub
Package Objects
☆12Jun 5, 2025Updated last year
UchidaMizuki / dibble
View on GitHub
Dimensional Data Frames
☆13Jan 17, 2026Updated 6 months ago
enricoschumann / tsdb
View on GitHub
A terribly-simple data base for time series
☆14Mar 25, 2026Updated 4 months ago