nexusflowai / NexusBenchLinks
Nexusflow function call, tool use, and agent benchmarks.
☆19Updated 5 months ago
Alternatives and similar repositories for NexusBench
Users that are interested in NexusBench are comparing it to the libraries listed below
Sorting:
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- ☆20Updated 11 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 6 months ago
- ☆9Updated last month
- Training hybrid models for dummies.☆21Updated 4 months ago
- ☆16Updated 2 months ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 8 months ago
- ☆49Updated 6 months ago
- ☆37Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- XmodelLM☆39Updated 6 months ago
- ☆13Updated 5 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 7 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 10 months ago
- Companion code to https://arxiv.org/abs/2409.03797v2☆10Updated last week
- Official Repository for Task-Circuit Quantization☆20Updated last month
- ☆19Updated 2 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆24Updated last month
- ☆41Updated 5 months ago
- ☆35Updated last month
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 4 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last month
- ☆15Updated last month
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆36Updated last year