[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism
☆34Jun 25, 2024Updated last year
Alternatives and similar repositories for NewsBench
Users that are interested in NewsBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Jun 10, 2025Updated 10 months ago
- PGRAG☆53Jul 16, 2024Updated last year
- Controllable Text Generation for Large Language Models: A Survey☆204Aug 27, 2024Updated last year
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆180Jun 7, 2025Updated 10 months ago
- Explore visualization tools for understanding Transformer-based large language models (LLMs)☆22Dec 1, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆370May 20, 2025Updated 10 months ago
- ☆12Jul 25, 2023Updated 2 years ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆173Dec 7, 2024Updated last year
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆271Sep 25, 2025Updated 6 months ago
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 8 months ago
- ☆13Jan 5, 2025Updated last year
- ☆33Nov 11, 2024Updated last year
- Official implementation of 'P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering'. (Accepted by ICLR 2024)☆18Jan 19, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆28Jan 25, 2025Updated last year
- This repository contains the code for our paper "Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive …☆14Nov 23, 2023Updated 2 years ago
- Official code for "Federated Weakly Supervised Video Anomaly Detection with Multimodal Prompt" (AAAI2025)☆26May 27, 2025Updated 10 months ago
- ☆19Sep 10, 2025Updated 7 months ago
- Original implementation of SmartRAG: Jointly Learn RAG-Related Tasks From the Environment Feedback (ICLR 2025)☆17Feb 17, 2025Updated last year
- Subspace Representation Learning for Sparse Linear Arrays to Localize More Sources than Sensors: A Deep Learning Methodology☆19Mar 18, 2025Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆138Jun 5, 2024Updated last year
- The official repo for ICRA 2023 paper 'Accelerating Multi-Agent Planning Using Graph Transformers with Bounded Suboptimality'☆20May 25, 2023Updated 2 years ago
- Academic Survey Paper Generation.☆962Jan 12, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision…☆24Apr 6, 2025Updated last year
- [NeurIPS 2023 Spotlight] The Pursuit of Human Labeling: A New Perspective on Unsupervised Learning☆19Nov 7, 2023Updated 2 years ago
- Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retriev…☆722Updated this week
- Peter-JXL's profile with 19 stars and 12 forks 🎉☆19Apr 10, 2026Updated last week
- Benchmarking LLM Inference Speeds☆13Apr 7, 2026Updated last week
- ☆11Nov 12, 2024Updated last year
- ☆22Aug 30, 2021Updated 4 years ago
- Multimodal Classification and Out-of-distribution Detection☆18Apr 4, 2025Updated last year
- I don't want to maintain this project, the code probably won't compile or run. Archived.☆13Feb 25, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The repository for paper <Evaluating Open-QA Evaluation>☆25Apr 9, 2024Updated 2 years ago
- 中文原生多层次文生视频测评基准☆18Jul 8, 2024Updated last year
- get the media stream from Dahua/Haikang IPC SDK, and demux the stream to vedio and audio ES☆13Nov 15, 2015Updated 10 years ago
- AutoGenesis is an AI-powered automated testing framework based on Model Context Protocol (MCP), supporting multiple platforms including d…☆71Updated this week
- SRS is an industrial-strength live cluster, with simple code and best conceptual integrity.☆11Nov 14, 2021Updated 4 years ago
- Chinese-Mimi 是对 Moshi 模型的声码器进行了中文语料上的适配。☆34Mar 13, 2025Updated last year
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆16Mar 18, 2026Updated 3 weeks ago