OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
☆19Jun 25, 2024Updated last year
Alternatives and similar repositories for opencompass
Users that are interested in opencompass are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆144Jul 24, 2024Updated last year
- Gantt Chart using echarts☆13Mar 31, 2021Updated 4 years ago
- 本项目致力于打造数智化平台级智能人机交互产品,结合智能知识库和知识检索的功能,满足高效运行和优质服务的需求。☆19Apr 29, 2024Updated last year
- The implementation of the IEEE S&P 2024 paper MM-BD: Post-Training Detection of Backdoor Attacks with Arbitrary Backdoor Pattern Types Us…☆16May 12, 2024Updated last year
- 大学期间做的各样项目,有Java/Python/JavaScript/Vert.X/SpringBoot☆10Feb 28, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository contains a collection of the most influential papers, and benchmarks related to Large Language Models (LLMs) based Agent …☆49Jul 7, 2025Updated 8 months ago
- ☆17Apr 17, 2025Updated 11 months ago
- Toolkit for building prompt templates for language models☆12Sep 30, 2022Updated 3 years ago
- Code release for the paper I. Zakazov, B. Shirokikh et al. "Anatomy of Domain Shift Impact on U-Net Layers in MRI Segmentation" (MICCAI 2…☆13Jul 19, 2021Updated 4 years ago
- ☆20Jan 9, 2024Updated 2 years ago
- A simple anomaly detection algorithm for medical imaging based on multi-atlas image registration and negative log likelihood.☆19Jul 5, 2021Updated 4 years ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆64Jan 28, 2026Updated last month
- 基于neo4j的知识图谱, 构建智能多轮问答☆14May 12, 2022Updated 3 years ago
- Code and Data for ACL 2023 paper I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors☆16Jun 7, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 元搜索引擎 searchengine 元数据 元搜索☆15Jul 19, 2020Updated 5 years ago
- ☆13Mar 15, 2022Updated 4 years ago
- 计算机体系结构研讨课 2020年秋季 UCAS 《CPU 设计实战》 Lab11~12 & 14~15☆22Dec 22, 2020Updated 5 years ago
- A demonstration of hybrid search with reranking using Qdrant and BGE-M3 model. A showcase of dense and sparse retrieval combined with Col…☆30Apr 4, 2025Updated 11 months ago
- Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral☆32Nov 18, 2025Updated 4 months ago
- BackTime: Backdoor Attacks on Multivariate Time Series Forecasting☆31Apr 14, 2025Updated 11 months ago
- Transfer Learning☆10Aug 3, 2018Updated 7 years ago
- Codes for Paper: From Hypergraph Energy Functions to Hypergraph Neural Networks☆23Jun 29, 2023Updated 2 years ago
- 基于pytorch+bilstm_crf的中文命名实体识别☆14Sep 13, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Predicting brain activity from word embeddings during natural language comprehension☆23Feb 20, 2024Updated 2 years ago
- [ICLR 2026] PatchRefiner V2: Fast and Lightweight Real-Domain High-Resolution Metric Depth Estimation☆26Feb 21, 2026Updated last month
- This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Lea…☆25Nov 19, 2024Updated last year
- The project is an attempt to implement the paper Content Based Image Retrieval using Color Difference Histogram by Guang-Hai Liu et all. …☆13Dec 16, 2020Updated 5 years ago
- PyTorch implementation of CycleGAN.☆15Oct 24, 2017Updated 8 years ago
- ☆59Feb 11, 2026Updated last month
- Code for the paper "Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering" (AAAI 2021)☆30Feb 19, 2021Updated 5 years ago
- This repository is for the paper of ICSE 2023: Regression Fuzzing for Deep Learning Systems☆12Feb 21, 2024Updated 2 years ago
- A curated list of Neuro-Symbolic Visual Reasoning☆16Jul 23, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICLR2026] NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents☆142Feb 27, 2026Updated last month
- ☆29Jun 17, 2024Updated last year
- [CVPR 2023] The official implementation of our CVPR 2023 paper "Detecting Backdoors During the Inference Stage Based on Corruption Robust…☆25May 25, 2023Updated 2 years ago
- A collection of DPP code and other diverse sampling algorithms☆10Nov 12, 2014Updated 11 years ago
- ☆64Dec 15, 2025Updated 3 months ago
- Djinn-Agent: A lightweight CLI tool for seamless interaction with Claude's advanced computer-use capabilities, automating complex tasks f…☆27Oct 28, 2024Updated last year
- meta-comprehensive-rag-benchmark-kdd-cup-2024 phase1 task1 rank3☆21Jun 21, 2024Updated last year