Danau5tin / calculator_agent_rl

Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.
17Updated this week

Alternatives and similar repositories for calculator_agent_rl:

Users that are interested in calculator_agent_rl are comparing it to the libraries listed below