PULSE-EVAL
☆24Jan 12, 2024Updated 2 years ago
Alternatives and similar repositories for PULSE-EVAL
Users that are interested in PULSE-EVAL are comparing it to the libraries listed below
Sorting:
- 用大语言模型做结构化工具☆33Mar 12, 2024Updated last year
- Counting-Stars (★)☆83Nov 24, 2025Updated 3 months ago
- Official implementation of OpenTab (ICLR2024)☆13Mar 27, 2024Updated last year
- ☆10Dec 28, 2023Updated 2 years ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- ☆16Aug 23, 2023Updated 2 years ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 9 months ago
- Cog wrapper for playgroundai/playground-v2.5-1024px-aesthetic☆17Nov 25, 2024Updated last year
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆18Nov 15, 2024Updated last year
- Benchmarking LLM Inference Speeds☆13Feb 4, 2026Updated last month
- Instruction Following Eval☆15Jan 16, 2025Updated last year
- ☆22Jan 14, 2025Updated last year
- CCKS 2022 通用信息抽取☆13May 1, 2022Updated 3 years ago
- The free energy principle☆18Feb 16, 2025Updated last year
- 中文原生多层次文生视频测评基准☆18Jul 8, 2024Updated last year
- get the media stream from Dahua/Haikang IPC SDK, and demux the stream to vedio and audio ES☆13Nov 15, 2015Updated 10 years ago
- Joint learning of object and action detectors☆15Nov 5, 2019Updated 6 years ago
- ☆18Nov 30, 2025Updated 3 months ago
- Very simple and crude HTTP server written in C☆14Jul 29, 2018Updated 7 years ago
- Code for "Moving on from OntoNotes: Coreference Resolution Model Transfer" and "Incremental Neural Coreference Resolution in Constant Mem…☆17Mar 11, 2022Updated 3 years ago
- 这是一个基于OpenCompass的模型评测系统,该系统提供了前端页面UI以方便用户自助开展评测工作。☆25Aug 25, 2025Updated 6 months ago
- [IJCAI'19] Code for "Self-attentive Biaffine Dependency Parsing"☆16Jun 13, 2019Updated 6 years ago
- “悟道”源代码☆21Aug 24, 2021Updated 4 years ago
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- ☆18May 28, 2021Updated 4 years ago
- The official repository of the paper "The Digital Cybersecurity Expert: How Far Have We Come?" presented in IEEE S&P 2025☆24May 21, 2025Updated 9 months ago
- Source code of ACL2022 "Headed-Span-Based Projective Dependency Parsing" and "Combining (second-order) graph-based and headed-span-based …☆16Jan 12, 2023Updated 3 years ago
- The source code for ACL 2021 paper☆19Oct 9, 2021Updated 4 years ago
- ☆21Aug 19, 2024Updated last year
- Source code of COLING 2022 paper "A Contrastive Cross-channel Data Augmentation Framework for Aspect-based Sentiment Analysis"☆22Feb 18, 2023Updated 3 years ago
- Reproducible Language Agent Research☆34Jun 25, 2025Updated 8 months ago
- Task Complexity Classifier using Transformer-based NLP model based on Bloom's Taxonomy☆34Aug 18, 2025Updated 6 months ago
- VulnHeist is an Automated Penetration Testing Suite 🔖 that streamlines vulnerability scanning 🔍 and exploitation 💥 using Nmap 🌐 and …☆36Mar 22, 2025Updated 11 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- LLM+RAG for QA☆22Jan 15, 2024Updated 2 years ago
- Code for the EMNLP 2020 paper "Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks"☆25Jul 3, 2021Updated 4 years ago
- 📑MLCE-为大模型医学水平测试而总结的中文医学考试数据集☆27Jul 7, 2024Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆109Apr 10, 2024Updated last year
- AI-PP3 is a command-line tool that uses artificial intelligence to analyze RAW photos and generate optimized processing profiles (PP3 fil…☆45Aug 11, 2025Updated 6 months ago