openmedlab/PULSE-EVAL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/openmedlab/PULSE-EVAL)

openmedlab / PULSE-EVAL

PULSE-EVAL

☆24

Alternatives and similar repositories for PULSE-EVAL

Users that are interested in PULSE-EVAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stanleylsx / text_embedding
View on GitHub
一个用于训练句子embedding的工具，支持Cosent以及Simcse、infonce
☆24Jun 17, 2025Updated last year
openmedlab / PULSE
View on GitHub
PULSE: Pretrained and Unified Language Service Engine
☆498Dec 26, 2023Updated 2 years ago
nick7nlp / Counting-Stars
View on GitHub
Counting-Stars (★)
☆83Nov 24, 2025Updated 8 months ago
zonghui0228 / LLM-Chinese-NMLE
View on GitHub
中国执业医师、药师、护士资格考试数据集和ChatGPT评估
☆16Mar 13, 2026Updated 4 months ago
sangHa0411 / Llama-Instruction-Tuning
View on GitHub
☆10Dec 28, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
amazon-science / llm-open-domain-table-reasoner
View on GitHub
Official implementation of OpenTab (ICLR2024)
☆14Mar 27, 2024Updated 2 years ago
listentm / CROWDSELECT
View on GitHub
We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…
☆20May 20, 2025Updated last year
MAGIC-AI4Med / MedRBench
View on GitHub
[Nature Communications] The official code for "Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases".
☆70Nov 7, 2025Updated 8 months ago
FreedomIntelligence / CMB
View on GitHub
CMB, A Comprehensive Medical Benchmark in Chinese
☆249Mar 27, 2025Updated last year
Mihir3009 / LogicBench
View on GitHub
LogicBench is a natural language question-answering dataset consisting of 25 different reasoning patterns spanning over propositional, fi…
☆40May 2, 2024Updated 2 years ago
dbamman / lrec2020-coref
View on GitHub
Code and data to support Bamman et al. (2020), "A Dataset of Literary Coreference" (LREC)
☆11Dec 8, 2022Updated 3 years ago
MangoKiller / SimOAR_OAR
View on GitHub
☆11Nov 8, 2023Updated 2 years ago
facebookresearch / dual-system-for-visual-language-reasoning
View on GitHub
Github repo for Peifeng's internship project
☆13Nov 7, 2023Updated 2 years ago
merlresearch / SMART
View on GitHub
Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"
☆11Aug 10, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
GraphPKU / CoI
View on GitHub
Chain of Images for Intuitively Reasoning
☆10Nov 29, 2023Updated 2 years ago
tabzhangjx / MixupExplainer
View on GitHub
☆10Jun 11, 2023Updated 3 years ago
Cohere-Labs-Community / iterative-data-selection
View on GitHub
☆30Nov 5, 2024Updated last year
wcy405100 / TurnoverRatio_Prediction_Pytorch
View on GitHub
使用深度学习模型LSTM和ConvLSTM结合Attention，对金融衍生品的成交持仓比指标进行预测
☆19Jan 7, 2022Updated 4 years ago
mega002 / qdmr-based-question-generation
View on GitHub
The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".
☆12Oct 18, 2021Updated 4 years ago
streamlit / example-app-interactive-table
View on GitHub
☆18Jan 12, 2024Updated 2 years ago
NJUNLP / Hallu-PI
View on GitHub
The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …
☆11Sep 27, 2024Updated last year
snu-larr / ibc_official
View on GitHub
Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)
☆10Jul 6, 2023Updated 3 years ago
Hoyyyaard / NavGPT
View on GitHub
☆10Nov 16, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
openmedlab / CITE
View on GitHub
[MICCAI'23] Text-guided Foundation Model Adaptation for Pathological Image Classification
☆141Dec 26, 2023Updated 2 years ago
michael-wzhu / PromptCBLUE
View on GitHub
PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese
☆394Jan 23, 2024Updated 2 years ago
Achillesxu / SpliteDahua-HaikangStreamToES
View on GitHub
get the media stream from Dahua/Haikang IPC SDK, and demux the stream to vedio and audio ES
☆14Nov 15, 2015Updated 10 years ago
zzh-SJTU / CRT-QA
View on GitHub
The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…
☆13May 19, 2025Updated last year
OpenClassrooms-Student-Center / Project-10-Bank-API
View on GitHub
☆11Jul 31, 2024Updated last year
omigeft / Chinese-Clinical-Terminology-Standardization-Task
View on GitHub
基于LLM实现CHIP2021-Task3中文临床术语标准化任务，准确率约70%。
☆16Dec 16, 2024Updated last year
ErxinYu / CoSafe-Dataset
View on GitHub
☆13Nov 12, 2024Updated last year
openmedlab / MedLSAM
View on GitHub
MedLSAM: Localize and Segment Anything Model for 3D Medical Images
☆522Apr 30, 2024Updated 2 years ago
IRMBed / IRMBed
View on GitHub
This is the project for IRM methods
☆12Sep 13, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
navidmdn / logic_based_qa
View on GitHub
Code and notebooks and data for the paper "Domain Specific Question Answering Over Knowledge Graphs Using Logical Programming and Large L…
☆12Jan 23, 2024Updated 2 years ago
lucataco / cog-playground-v2.5-1024px-aesthetic
View on GitHub
Cog wrapper for playgroundai/playground-v2.5-1024px-aesthetic
☆17Nov 25, 2024Updated last year
xiatingyu / SFT-DataSelection-at-scale
View on GitHub
☆34Feb 9, 2025Updated last year
caspian-yez / libgbt28181
View on GitHub
I don't want to maintain this project, the code probably won't compile or run. Archived.
☆13Feb 25, 2024Updated 2 years ago
BorealisAI / llm-pddl-planning
View on GitHub
☆18Feb 20, 2025Updated last year
math-eval / MathEval
View on GitHub
MathEval is a benchmark dedicated to the holistic evaluation on mathematical capacities of LLMs.
☆87Nov 15, 2024Updated last year
wangcunxiang / Graph-aS-Tokens
View on GitHub
☆10Nov 29, 2024Updated last year