[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism
☆34Jun 25, 2024Updated last year
Alternatives and similar repositories for NewsBench
Users that are interested in NewsBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Jun 10, 2025Updated 10 months ago
- PGRAG☆53Jul 16, 2024Updated last year
- Controllable Text Generation for Large Language Models: A Survey☆205Aug 27, 2024Updated last year
- Explore visualization tools for understanding Transformer-based large language models (LLMs)☆23Dec 1, 2024Updated last year
- ☆57Mar 11, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆377May 20, 2025Updated 11 months ago
- [ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation☆181Nov 14, 2025Updated 5 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆174Dec 7, 2024Updated last year
- HaluMem is the first operation level hallucination evaluation benchmark tailored to agent memory systems.☆129Apr 30, 2026Updated last week
- ☆15Sep 28, 2020Updated 5 years ago
- A research project exploring fine-tuning BERT-style models for text generation☆40Nov 30, 2025Updated 5 months ago
- A self-adaptive and class-balanced approach to improve deep neural network performance in the presence of noisy labels☆19Jul 2, 2024Updated last year
- Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)☆13Jul 25, 2024Updated last year
- Official PyTorch implementation of RadMamba☆22Aug 25, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Artifact evaluation for "E2Usd: Efficient-yet-effective Unsupervised State Detection for Multivariate Time Series" accepted by WWW'24☆13Jul 29, 2024Updated last year
- Documentation for the repository: https://github.com/MemTensor/MemOS☆25May 1, 2026Updated last week
- Official code for "Federated Weakly Supervised Video Anomaly Detection with Multimodal Prompt" (AAAI2025)☆27May 27, 2025Updated 11 months ago
- ☆19Sep 10, 2025Updated 7 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆139Jun 5, 2024Updated last year
- Academic Survey Paper Generation.☆967Jan 12, 2026Updated 3 months ago
- [ICDE 2024] VDTuner - Automated Performance Tuning for Vector Data Management Systems (Vector Databases)☆36Apr 21, 2024Updated 2 years ago
- The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision…☆24Apr 6, 2025Updated last year
- [NeurIPS 2023 Spotlight] The Pursuit of Human Labeling: A New Perspective on Unsupervised Learning☆19Nov 7, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- 此仓库包括我所做过的来自各大题库的 OJ 算法题的题解,这些题库包括:浙江大学计算机程序设计能力考试(PAT )题库,UVa 题库,Codeup 题库,洛谷题库,北京大学 OJ 题库(POJ),安徽科技学院 ACM 题库(AOJ)......目前在主攻 PAT,完成度已过 5…☆18Jan 27, 2023Updated 3 years ago
- Instruction Following Eval☆17Jan 16, 2025Updated last year
- Benchmarking LLM Inference Speeds☆13Apr 7, 2026Updated last month
- I don't want to maintain this project, the code probably won't compile or run. Archived.☆13Feb 25, 2024Updated 2 years ago
- MATLAB code for the coarray tensor completion-based 2-D DOA estimation algorithm☆23Jul 3, 2024Updated last year
- get the media stream from Dahua/Haikang IPC SDK, and demux the stream to vedio and audio ES☆13Nov 15, 2015Updated 10 years ago
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆127Feb 3, 2026Updated 3 months ago
- SRS is an industrial-strength live cluster, with simple code and best conceptual integrity.☆11Nov 14, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Joint learning of object and action detectors☆15Nov 5, 2019Updated 6 years ago
- ☆24May 26, 2022Updated 3 years ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆194Oct 8, 2024Updated last year
- Claw-R1: Empowering OpenClaw with Advanced Agentic RL.☆176Apr 8, 2026Updated last month
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆258Oct 30, 2024Updated last year
- ☆12Nov 2, 2021Updated 4 years ago
- Bump charts are used to represent rank changes over time. By default there's no support from matplotlib or seaborn for this, and writing …☆21Aug 12, 2019Updated 6 years ago