TencentLLMEval is a comprehensive and extensive benchmark for artificial evaluation of large models that includes task trees, standards, data verification methods, and more.
☆41Mar 16, 2025Updated last year
Alternatives and similar repositories for TencentLLMEval
Users that are interested in TencentLLMEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆15Oct 4, 2024Updated last year
- The official repo of INF-34B models trained by INF Technology.☆34Jul 25, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- ☆12Jul 7, 2021Updated 4 years ago
- [IEEE TCSVT'24] Study of Subjective and Objective Naturalness Assessment of AI-Generated Images☆38Apr 29, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fusion Modality Approaches for sentiment analysis and emotion recognition task.☆12Feb 5, 2021Updated 5 years ago
- The Soft Cosine Measure system developed for the ARQMath-3 shared task evaluation of math information retrieval systems☆13Sep 8, 2022Updated 3 years ago
- QA Server Based Chinese CQA Site☆12Jul 14, 2021Updated 4 years ago
- The repository of CLEME (EMNLP 2023) and CLEME2.0 (ACL 2025)☆12May 17, 2025Updated last year
- Samplers, samplers, samplers☆10Oct 31, 2016Updated 9 years ago
- ☆17May 1, 2025Updated last year
- chemical master equation solver☆16May 2, 2018Updated 8 years ago
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆12Sep 28, 2023Updated 2 years ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆37May 26, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- 🎶 Open-source DAW to collaborate on music using Git.☆15Apr 6, 2024Updated 2 years ago
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 5 years ago
- 大语言模型训练和服务调 研☆37Aug 4, 2023Updated 2 years ago
- Common deep learning utils.☆18Nov 1, 2023Updated 2 years ago
- Implementation of the Snappy compression algorithm as a RoCC accelerator☆12Jul 29, 2019Updated 6 years ago
- Hung-Yi Lee Linear Algebra 2018 Fall Homework☆10May 5, 2019Updated 7 years ago
- REF//biendata.com/competition/CCKS2018_3/make-submission/☆17Aug 12, 2018Updated 7 years ago
- Code and dataset for 'Contrastive Aligned Joint Learning for Multilingual Summarization'☆13Mar 24, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Jul 17, 2020Updated 5 years ago
- Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT☆17Aug 15, 2023Updated 2 years ago
- Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction☆10May 25, 2022Updated 4 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- This is the official PyTorch implementation of our NeurIPS 2021 paper: "SalKG: Learning From Knowledge Graph Explanations for Commonsense…☆13Jun 9, 2022Updated 4 years ago
- ✨个人的python代码库(部分)。🌈包涵python基础、各类主流自然语言处理工具接口调用,Keras&Tensortflow实战,数据分析、爬虫等☆12Mar 17, 2022Updated 4 years ago
- [EMNLP 2023 Demo] "CLEVA: Chinese Language Models EVAluation Platform"☆64May 16, 2025Updated last year
- Feeling confused about super alignment? Here is a reading list☆43Jan 9, 2024Updated 2 years ago
- 分析Flickr数据集☆19Jan 17, 2018Updated 8 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ACM MM 2025] LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs☆16Apr 16, 2026Updated 2 months ago
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆17May 30, 2024Updated 2 years ago
- 文言文命名实体识别,基于BILSTM+CRF完成文言文的命名实体实体,识别实体包括人物、地点、机构、时间等。☆10Jan 19, 2021Updated 5 years ago
- ☆31May 15, 2026Updated last month
- Predict whether the protien sequence and the drug SMILES will be interact with each other☆13Apr 25, 2019Updated 7 years ago
- Rank2 solution (no-BERT) for 2019 Language and Intelligence Challenge - DuReader2.0 Machine Reading Comprehension.☆126Nov 1, 2019Updated 6 years ago
- ☆33Mar 13, 2024Updated 2 years ago