Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)
☆50Apr 19, 2024Updated last year
Alternatives and similar repositories for CELLO
Users that are interested in CELLO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆64Oct 12, 2023Updated 2 years ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆102Feb 20, 2025Updated last year
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆118Jun 12, 2025Updated 10 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆51Aug 26, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆18Feb 29, 2024Updated 2 years ago
- SysBench: Can Large Language Models Follow System Messages?☆40Sep 4, 2024Updated last year
- ☆23Oct 14, 2024Updated last year
- ☆59Aug 22, 2024Updated last year
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆53Jun 24, 2024Updated last year
- ☆17Nov 3, 2024Updated last year
- Collection of papers for scalable automated alignment.☆93Oct 22, 2024Updated last year
- The world's most intuitive and reliable strongly-typed collaborative library☆27Feb 8, 2026Updated 2 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Jul 9, 2023Updated 2 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 8 months ago
- Based on EEG signal, the differential entropy feature is extracted, and the convolution neural network based on time domain network and a…☆13Sep 21, 2022Updated 3 years ago
- A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…☆47Feb 15, 2026Updated 2 months ago
- ☆11Nov 23, 2024Updated last year
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆186Apr 2, 2026Updated 2 weeks ago
- 北京工业大学 嵌入式系统的4个实践项目以及综合项目☆11Apr 26, 2023Updated 2 years ago
- A framework for evolving and testing question-answering datasets with various models.☆23Feb 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆41Sep 24, 2024Updated last year
- ☆33Jan 11, 2024Updated 2 years ago
- GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks☆16Apr 16, 2019Updated 6 years ago
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆67Oct 13, 2020Updated 5 years ago
- ☆16Jun 25, 2025Updated 9 months ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.☆41Jul 3, 2025Updated 9 months ago
- ☆148Jul 1, 2024Updated last year
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆114Jun 16, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "☆14Jul 19, 2024Updated last year
- [ACL 2025] The official repository for "HyKGE: A Hypothesis Knowledge Graph Enhanced Framework for Accurate and Reliable Medical LLMs Res…☆22Feb 27, 2025Updated last year
- An easy-to-use Python framework to defend against jailbreak prompts.☆21Mar 22, 2025Updated last year
- ☆99Dec 5, 2023Updated 2 years ago
- DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings☆19Nov 24, 2021Updated 4 years ago
- ☆29Oct 9, 2025Updated 6 months ago
- A custom line wrap layout ,support set max lines.(自定义流式布局,支持设置最大行数)☆11Apr 13, 2018Updated 8 years ago