code for ACL2024-main: BatchEval: Towards Human-like Text Evaluation
☆19May 20, 2024Updated last year
Alternatives and similar repositories for BatchEval
Users that are interested in BatchEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code for EACL2024-main:Generative Dense Retrieval: Memory Can Be a Burden☆32Jan 19, 2024Updated 2 years ago
- ☆14Jul 17, 2025Updated 8 months ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 7 months ago
- [XLLM@ACL2025] Official Code for "Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided Distillation"☆23Jul 29, 2025Updated 8 months ago
- The code for paper "Diversifying Dialog Generation via Adaptive Label Smoothing" in ACL 2021.☆26Jun 7, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆21Aug 1, 2025Updated 7 months ago
- ☆16Jun 25, 2025Updated 9 months ago
- This is the repository of our ACL 2022 paper MISC: A MIxed Strategy-Aware Model Integrating COMET for Emotional Support ConversatioN☆40Apr 16, 2022Updated 3 years ago
- ☆12Apr 24, 2024Updated last year
- Reading comprehension based question-answering model for news articles.☆11Jun 22, 2022Updated 3 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated 11 months ago
- ☆18Nov 13, 2024Updated last year
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 2 years ago
- [EMNLP 2024 Main] Official repository of paper "SLANG: New Concept Comprehension of Large Language Models"☆14Oct 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- C^3-Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking☆38Mar 1, 2026Updated 3 weeks ago
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated 11 months ago
- The tri-/dual-polarized channel modeling for HMIMO surfaces, the details refer to the paper "Tri-Polarized Holographic MIMO Surfaces for …☆14May 4, 2023Updated 2 years ago
- A biological dual-language foundation model☆12Jun 16, 2025Updated 9 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆102Feb 20, 2025Updated last year
- ☆16Oct 29, 2023Updated 2 years ago
- ☆23Sep 6, 2021Updated 4 years ago
- [NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners☆22Jan 26, 2023Updated 3 years ago
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆22Nov 19, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [EMNLP 2025] Dataset and Code of "PersonaGym: Evaluating Persona Agents and LLMs"☆40Aug 21, 2025Updated 7 months ago
- 中国科学院大学(国科大)研一课程☆18May 24, 2023Updated 2 years ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated 11 months ago
- Project for a Computer Security class based on CSAW capture the flag challenges☆13Mar 19, 2014Updated 12 years ago
- Single-sequence protein-RNA complex structure prediction with biological language models☆16Sep 23, 2025Updated 6 months ago
- An open source 3d printable mobile robot☆37Oct 22, 2013Updated 12 years ago
- [ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation☆14Jul 11, 2023Updated 2 years ago
- ☆21Aug 9, 2024Updated last year
- Meta in-context learning for protein fitness prediction☆16Feb 7, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementation of EMNLP 2023 Findings: Improving Question Generation with Multi-level Content Planning☆18Nov 30, 2023Updated 2 years ago
- 我的一些开源文档☆10Feb 18, 2025Updated last year
- ☆15Nov 26, 2020Updated 5 years ago
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…☆210May 28, 2024Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated 2 years ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated last year