LiaoMengqi / LLM4Game24

Long CoT Fine-Tuning and Reinforcement Learning for LLMs in the Context of the 24-Point Game: A Toy Project
13Updated last month

Alternatives and similar repositories for LLM4Game24:

Users that are interested in LLM4Game24 are comparing it to the libraries listed below