汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测
☆38Dec 26, 2023Updated 2 years ago
Alternatives and similar repositories for SuperCLUE-Auto
Users that are interested in SuperCLUE-Auto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆40May 31, 2025Updated 11 months ago
- [IROS 2024] PhysORD: A Neuro-Symbolic Approach for Physics-infused Motion Prediction in Off-road Driving☆22Feb 15, 2026Updated 3 months ago
- Dataset and codes for SEntFiN☆10May 31, 2023Updated 2 years ago
- OpenMDW License☆20May 19, 2026Updated last week
- FinRAD: Financial Readability Assessment Dataset - 13,000+ Definitions of Financial Terms for Measuring Readability☆15Nov 2, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CAC2023] Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation☆11Nov 28, 2024Updated last year
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆118Jun 12, 2025Updated 11 months ago
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- ☆20Jul 28, 2022Updated 3 years ago
- 石油领域大语言模型☆18Feb 22, 2024Updated 2 years ago
- ☆21Aug 19, 2024Updated last year
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆24Apr 10, 2025Updated last year
- ☆10Aug 14, 2019Updated 6 years ago
- [ACL 2026] A large-scale longitudinal study on robust and fair evaluation of LLMs — 200K+ generative questions across 13 disciplines☆37May 12, 2026Updated 2 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- 使用分布式技术结合前端Vue框架,后端mycat+mybatis+springboot框架实现金融分布式交易系统,包括:开户,登录,信息查询,明细查询,一对一转账,计息,对账等金融交易的核心功能。☆11Sep 3, 2020Updated 5 years ago
- 面向大模型的民族文化数据集☆12May 26, 2025Updated 11 months ago
- ☆11Nov 21, 2024Updated last year
- A curated list of resources dedicated to word segmentation☆12Jan 9, 2019Updated 7 years ago
- 第19届“花旗杯”金融创新应用大赛参赛作品,以BERT模型为核心,组合实体抽取和消歧、情绪分析两个下游任务,抽取出非结构化文本中的债券实体和公司实体,并分别对相应实体进行对应文本的情绪分析,为债券违约提供参考,并将模型封装部署到了Web端。项目已经部署到:http://12…☆12Jun 5, 2024Updated last year
- Show summary of a large number of URLs in a Jupyter Notebook☆19Apr 8, 2026Updated last month
- 🍏专门为 2024 书生·浦语大模型挑战赛 (春季赛) 准备的 Repo🍎收录了赫萝相关的微调源码☆12Sep 20, 2024Updated last year
- Go implementation of filesystem-level locking.☆13Feb 12, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 2022年华为软件精英赛初赛☆11Apr 2, 2022Updated 4 years ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- CCL2020 第二届“小牛杯”幽默计算——情景喜剧笑点识别☆13Sep 29, 2020Updated 5 years ago
- 首届社交群体智能算法大赛 【赛题1:社交媒体舆论场虚假账号检测】第三名(0.8248)方案☆12May 30, 2024Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆33Jul 7, 2024Updated last year
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆69Aug 10, 2025Updated 9 months ago
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中☆110Apr 28, 2025Updated last year
- vLoong能源AI挑战赛——异常检测赛 第五名开源代码:基于lgb单模型☆14Nov 1, 2022Updated 3 years ago
- BERT&RoBERTa预训练代码,tensorflow和torch两种版本实现☆13Feb 8, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Instruction Following Eval☆17Jan 16, 2025Updated last year
- 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答, 75+ baseline☆61Dec 7, 2023Updated 2 years ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated last year
- Zero-Cost Whole-Body Teleoperation for Mobile Manipulation☆12Mar 4, 2025Updated last year
- Code for acl2017 paper "An unsupervised neural attention model for aspect extraction"☆27Oct 14, 2018Updated 7 years ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆43Jan 7, 2025Updated last year
- Source code of our EMNLP 2022 paper: Co-guiding Net: Achieving Mutual Guidances between Multiple Intent Detection and Slot Filling via He…☆12Nov 14, 2022Updated 3 years ago