BIBench:数据分析领域LLM评测基准
☆23Mar 2, 2024Updated 2 years ago
Alternatives and similar repositories for BIBench
Users that are interested in BIBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Professional Wargaming LLM Toolbox☆28Jul 9, 2025Updated 11 months ago
- ☆10Jul 5, 2023Updated 2 years ago
- ☆12Jun 23, 2023Updated 2 years ago
- ☆11Nov 17, 2023Updated 2 years ago
- 论文一体化写作神器(Python)☆17Apr 11, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆24Apr 25, 2023Updated 3 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- 强化学习的数学原理代码练习☆19Apr 17, 2024Updated 2 years ago
- ICML'20: Intrinsic Reward Driven Imitation Learning via Generative Model☆15Nov 5, 2021Updated 4 years ago
- ☆13Oct 26, 2020Updated 5 years ago
- Generalizable Implicit Hate Speech Detection using Contrastive Learning (COLING 2022)☆14Oct 9, 2022Updated 3 years ago
- 双十一淘宝秒杀☆13Nov 10, 2018Updated 7 years ago
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]☆35Sep 8, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language mo…☆19Mar 19, 2025Updated last year
- ☆21Dec 24, 2024Updated last year
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- SciGen☆24Aug 10, 2021Updated 4 years ago
- StereoNet: Guided Hierarchical Refinement for Real-Time Edge-Aware Depth prediction model in pytorch. ECCV2018☆12Aug 14, 2019Updated 6 years ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆157Dec 24, 2024Updated last year
- ☆34Aug 26, 2025Updated 9 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 10 months ago
- CCL2022 领域问答库构建测评☆20Oct 31, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ⚙️ Lightweight & smart Bun & Browser configuration loader.☆16Updated this week
- 嵌套命名实体识别 Nested NER☆19Nov 14, 2021Updated 4 years ago
- PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency☆20Mar 29, 2024Updated 2 years ago
- An end-to-end benchmark suite of multi-modal DNN applications for system-architecture co-design☆22Dec 13, 2024Updated last year
- Universal Robustness Evaluation Toolkit (for Evasion)☆32Sep 17, 2025Updated 8 months ago
- The Process Intelligence Tool for Linux☆35Mar 10, 2026Updated 3 months ago
- Mac 上自动捕获并打开 TFS 链接的小工具☆32Dec 9, 2018Updated 7 years ago
- using lear to do ner extraction☆29Mar 13, 2022Updated 4 years ago
- "Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"☆39Nov 13, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆30Apr 16, 2026Updated last month
- walterra's collections of helpers for agentic coding☆34Mar 23, 2026Updated 2 months ago
- 时间关键词正则提取以及标准化☆20Dec 19, 2021Updated 4 years ago
- A multi-element multi-domain dataset for Aspect-Based Sentiment Analysis☆25Jul 5, 2023Updated 2 years ago
- Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]☆288Jul 28, 2025Updated 10 months ago
- ☆12Mar 29, 2019Updated 7 years ago
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆39Dec 18, 2023Updated 2 years ago