BIBench:数据分析领域LLM评测基准
☆22Mar 2, 2024Updated 2 years ago
Alternatives and similar repositories for BIBench
Users that are interested in BIBench are comparing it to the libraries listed below
Sorting:
- Parses and Plots Illumina SAV files☆13Jan 30, 2019Updated 7 years ago
- ⚙️ Lightweight & smart Bun & Browser configuration loader.☆15Feb 27, 2026Updated last week
- ☆10Jul 5, 2023Updated 2 years ago
- ☆11Jul 21, 2024Updated last year
- Professional Wargaming LLM Toolbox☆20Jul 9, 2025Updated 7 months ago
- This repository includes data on areas considered high-risk for COVID-19 in China from November 24, 2022 to December 23, 2022☆10Jan 9, 2023Updated 3 years ago
- ☆48Updated this week
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated 8 months ago
- A user-friendly interface built on top of Thinking Machines Tinker API that lets you fine-tune LLMs, chat with your trained model, and de…☆27Jan 31, 2026Updated last month
- ☆12Jun 23, 2023Updated 2 years ago
- ☆16Jan 20, 2025Updated last year
- Implementation of Decision Stacks: Flexible RL via Modular Generative Models [NeurIPS 2023]☆12Jun 27, 2023Updated 2 years ago
- ☆11Feb 4, 2021Updated 5 years ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 10 months ago
- 微聚,专业的数据标注,采集平台☆13Jun 19, 2018Updated 7 years ago
- 基于SpringBoot2的个性化推荐教育学习网站。☆13Apr 8, 2018Updated 7 years ago
- A react-typescript component for Plotly.JS graphs.☆15Feb 29, 2020Updated 6 years ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆21Oct 16, 2025Updated 4 months ago
- A chaos engineering library for Elixir inspired by Netflix's Chaos Monkey☆25Feb 7, 2026Updated 3 weeks ago
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆65Oct 16, 2024Updated last year
- minify html with CSS and JS☆14Nov 15, 2019Updated 6 years ago
- Companion code to https://arxiv.org/abs/2409.03797v2☆19Sep 18, 2025Updated 5 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 4 months ago
- A package dedicated for running benchmark agreement testing☆17Sep 18, 2025Updated 5 months ago
- Reusable components for AI coding agents: skills, subagents, MCP servers, and extensions.☆29Feb 26, 2026Updated last week
- ☆15Feb 23, 2026Updated last week
- Multi-agent coordination for Pi - presence, messaging, file reservations☆53Feb 27, 2026Updated last week
- Automated skill creation workshop for Claude Code☆38Nov 14, 2025Updated 3 months ago
- 抓取东方财富行业研报,停止更新☆12Jan 31, 2024Updated 2 years ago
- ☆32Aug 26, 2025Updated 6 months ago
- The official implementation of the paper “Anchored Supervised Fine-Tuning”☆30Feb 12, 2026Updated 3 weeks ago
- Maze algorithms implemented in JavaScript - many maze generators and tiling patterns☆16Oct 2, 2022Updated 3 years ago
- ☆12Mar 29, 2019Updated 6 years ago
- Cassandra's Secondary Index implementation for FHIR® – Fast Healthcare Interoperability Resources. The index provides near real-time sear…☆16Aug 22, 2016Updated 9 years ago
- 3D Print - Hex surface vase (spiral print)☆17Mar 19, 2023Updated 2 years ago
- LLMs for Wargames☆16Sep 21, 2024Updated last year
- Face Recognition using RPI5 Hailo8L AI Accelerator KIT☆20Aug 30, 2024Updated last year
- Fork of the Google Code Ultra-Finance Project http://code.google.com/p/ultra-finance/☆30Mar 3, 2014Updated 12 years ago