Nexusflow function call, tool use, and agent benchmarks.
☆30Dec 13, 2024Updated last year
Alternatives and similar repositories for NexusBench
Users that are interested in NexusBench are comparing it to the libraries listed below
Sorting:
- This is a simple guide to help you build an Anthropic Claude Sonnet 3.5 chatbot interface with Gradio☆12Jun 23, 2024Updated last year
- ☆15Dec 3, 2024Updated last year
- ☆19Aug 1, 2025Updated 7 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 6 months ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated 10 months ago
- ☆96Dec 6, 2024Updated last year
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆38Apr 24, 2025Updated 10 months ago
- Config files for my GitHub profile.☆20Jan 27, 2026Updated last month
- The DPAB-α Benchmark☆32Jan 15, 2025Updated last year
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆25Nov 29, 2025Updated 3 months ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- o1 Chain of Thought Examples☆33Oct 4, 2024Updated last year
- ☆36Mar 20, 2024Updated last year
- ☆12Oct 28, 2021Updated 4 years ago
- Linux系统与网络管理课程作业收集 http://sec.cuc.edu.cn☆10Mar 14, 2022Updated 3 years ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Concurrency library☆17Oct 13, 2024Updated last year
- Official repository for K-EXAONE built by LG AI Research☆69Feb 6, 2026Updated 3 weeks ago
- ☆11Dec 23, 2024Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆96May 16, 2025Updated 9 months ago
- 🗂️ Project tempfiles backend server!!☆10Apr 29, 2024Updated last year
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- An active inference model of Lacanian psychoanalysis☆15Jun 7, 2025Updated 8 months ago
- CANdle - a library for using USB-FDCAN dongle and communicating with md80 drives☆15Sep 15, 2025Updated 5 months ago
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated last month
- ☆13Updated this week
- ☆12Jun 19, 2024Updated last year
- Pre-built Docker image for deploying OpenClaw on DigitalOcean App Platform☆35Feb 10, 2026Updated 3 weeks ago
- Payment rails made right. Award winning developer experience.☆28Jan 27, 2026Updated last month
- KaliGPT is a production-ready, AI-powered penetration testing assistant designed specifically for Kali Linux. It reads and understands te…☆25Dec 11, 2025Updated 2 months ago
- ☆16Feb 22, 2025Updated last year
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Simples Projektmanagement Tool für die Zusammenarbeit mit OpenClaw☆36Feb 7, 2026Updated 3 weeks ago
- Models for packages and the resources they contain.☆14Mar 10, 2024Updated last year
- Develop C++/CUDA extensions with PyTorch like Python scripts☆10Jan 7, 2026Updated last month
- Code, figure, and data repository for: Haase et al. (2023) Nature. https://doi.org/10.1038/s41586-023-06400-1☆11Aug 10, 2023Updated 2 years ago