TencentLLMEval is a comprehensive and extensive benchmark for artificial evaluation of large models that includes task trees, standards, data verification methods, and more.
☆41Mar 16, 2025Updated last year
Alternatives and similar repositories for TencentLLMEval
Users that are interested in TencentLLMEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆14Oct 4, 2024Updated last year
- ChatGPT相关资源汇总☆57Apr 24, 2023Updated 2 years ago
- The official repo of INF-34B models trained by INF Technology.☆34Jul 25, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- ☆12Jul 7, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 知乎话题树可视化☆15Apr 11, 2019Updated 6 years ago
- ☆11Mar 22, 2020Updated 6 years ago
- Defeasible Natural Language Inference☆13Dec 4, 2020Updated 5 years ago
- CommonsenseQA☆10Mar 28, 2020Updated 6 years ago
- The Soft Cosine Measure system developed for the ARQMath-3 shared task evaluation of math information retrieval systems☆13Sep 8, 2022Updated 3 years ago
- Example of displaying data in a collapsible tree using D3.js☆26May 17, 2025Updated 10 months ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- [COLING22] Text-to-Text Extraction and Verbalization of Biomedical Event Graphs☆10Nov 5, 2022Updated 3 years ago
- Samplers, samplers, samplers☆10Oct 31, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Graph Convolutional Networks (GCNs)☆10Nov 29, 2017Updated 8 years ago
- follow paper TRANSFER LEARNING FOR SEQUENCE TAGGING WITH HIERARCHICAL RECURRENT NETWORKS☆12Nov 9, 2018Updated 7 years ago
- ☆17May 1, 2025Updated 10 months ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆38May 26, 2025Updated 10 months ago
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 4 years ago
- Implementation of the Snappy compression algorithm as a RoCC accelerator☆12Jul 29, 2019Updated 6 years ago
- Code for PyMTL Tutorial @ ISCA 2019☆11Jun 22, 2019Updated 6 years ago
- machine translation data process tools☆10Apr 29, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Sep 5, 2021Updated 4 years ago
- Code and dataset for 'Contrastive Aligned Joint Learning for Multilingual Summarization'☆13Mar 24, 2022Updated 4 years ago
- Determine the polarity of amazon fine food reviews using ULMFiT, BERT, XLNet and RoBERTa☆12Sep 8, 2019Updated 6 years ago
- ☆15Jul 17, 2020Updated 5 years ago
- Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT☆17Aug 15, 2023Updated 2 years ago
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆25Nov 28, 2025Updated 4 months ago
- Digital Image Watermarking use matlab(DWT,DCT), GUI use python☆13Oct 8, 2019Updated 6 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- This is the official PyTorch implementation of our NeurIPS 2021 paper: "SalKG: Learning From Knowledge Graph Explanations for Commonsense…☆14Jun 9, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ✨个人的python代码库(部分)。🌈包涵python基础、各类主流自然语言处理工具接口调用,Keras&Tensortflow实战,数据分析、爬虫等☆12Mar 17, 2022Updated 4 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- Simple MIDAS Examples☆12Nov 25, 2018Updated 7 years ago
- 分析Flickr数据集☆19Jan 17, 2018Updated 8 years ago
- ☆10Jul 13, 2023Updated 2 years ago
- Feeling confused about super alignment? Here is a reading list☆44Jan 9, 2024Updated 2 years ago
- 使用环信3.xSDk在AndroidStudio平台开发集成的一个符合Android Material Design设计风格的聊天项目☆13May 9, 2019Updated 6 years ago