RAIVNLab / mnms

m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks
33Updated last month

Related projects

Alternatives and complementary repositories for mnms