Abbey4799/CELLO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Abbey4799/CELLO)

Abbey4799 / CELLO

Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)

☆51

Alternatives and similar repositories for CELLO

Users that are interested in CELLO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Abbey4799 / CuteGPT
View on GitHub
An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.
☆64Oct 12, 2023Updated 2 years ago
thu-coai / ComplexBench
View on GitHub
Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)
☆102Feb 20, 2025Updated last year
YJiangcm / FollowBench
View on GitHub
[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
☆118Jun 12, 2025Updated last year
siyuyuan / coscript
View on GitHub
Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning
☆36Aug 19, 2023Updated 2 years ago
yizhilll / CIF-Bench
View on GitHub
☆18Feb 29, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
PKU-Baichuan-MLSystemLab / SysBench
View on GitHub
SysBench: Can Large Language Models Follow System Messages?
☆40Sep 4, 2024Updated last year
Blue-Raincoat / SelectIT
View on GitHub
☆24Oct 14, 2024Updated last year
PKU-Baichuan-MLSystemLab / CFBench
View on GitHub
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
☆56Aug 26, 2024Updated last year
qinyiwei / InfoBench
View on GitHub
☆61Aug 22, 2024Updated last year
meowpass / FollowComplexInstruction
View on GitHub
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆55Jun 24, 2024Updated 2 years ago
hanningzhang / prm
View on GitHub
☆17Nov 3, 2024Updated last year
icip-cas / awesome-auto-alignment
View on GitHub
Collection of papers for scalable automated alignment.
☆92Oct 22, 2024Updated last year
princeton-nlp / WhatICLLearns
View on GitHub
[ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning
☆21Jul 9, 2023Updated 3 years ago
tatHi / maxmatch_dropout
View on GitHub
☆10Sep 13, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
yulene22 / T-A-MFFNET-Projects
View on GitHub
Based on EEG signal, the differential entropy feature is extracted, and the convolution neural network based on time domain network and a…
☆13Sep 21, 2022Updated 3 years ago
ShootingWong / RichRAG
View on GitHub
☆11Nov 23, 2024Updated last year
DAMO-NLP-SG / TempReason
View on GitHub
☆33Jan 11, 2024Updated 2 years ago
Moviw / BJUT_Embedded_System
View on GitHub
北京工业大学嵌入式系统的4个实践项目以及综合项目
☆11Apr 26, 2023Updated 3 years ago
wzhouad / WPO
View on GitHub
Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"
☆41Sep 24, 2024Updated last year
yuleiqin / RAIF
View on GitHub
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆32Oct 9, 2025Updated 9 months ago
joel-huang / zeroshot-capsnet-pytorch
View on GitHub
GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks
☆16Apr 16, 2019Updated 7 years ago
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆31Mar 5, 2024Updated 2 years ago
QwenLM / AutoIF
View on GitHub
☆336Jul 25, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
gao-xiao-bai / StrategyLLM
View on GitHub
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
☆22Dec 11, 2024Updated last year
Charrrrrlie / X-as-Supervision
View on GitHub
The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"
☆13Jan 22, 2025Updated last year
RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
LinxinS97 / NLPBench
View on GitHub
NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models
☆10Oct 27, 2023Updated 2 years ago
thu-coai / CritiqueLLM
View on GitHub
☆147Jul 1, 2024Updated 2 years ago
zxx000728 / CodeGPT
View on GitHub
CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT
☆113Jun 16, 2023Updated 3 years ago
thu-coai / CharacterBench
View on GitHub
[AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models
☆23Aug 1, 2025Updated 11 months ago
GAIR-NLP / ReAlign
View on GitHub
Reformatted Alignment
☆111Sep 23, 2024Updated last year
chenjiawei30 / ConsistentChat
View on GitHub
Code for "ConsistentChat: Building Skeleton-Guided Consistent Multi-Turn Dialogues for Large Language Models from Scratch", where dataset…
☆16Sep 8, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NanshineLoong / Self-Evolving-Benchmark
View on GitHub
A framework for evolving and testing question-answering datasets with various models.
☆26Feb 28, 2024Updated 2 years ago
zhengcx / LineWrapContainer
View on GitHub
A custom line wrap layout ,support set max lines.(自定义流式布局，支持设置最大行数)
☆10Apr 13, 2018Updated 8 years ago
AndrewZhe / Revisit-DocRED
View on GitHub
☆18May 17, 2022Updated 4 years ago
tengxiaoliu / XoT
View on GitHub
[EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
☆27Nov 4, 2023Updated 2 years ago
MikeGu721 / XiezhiBenchmark
View on GitHub
☆98Dec 5, 2023Updated 2 years ago
Moonlight-Syntax / LUNA
View on GitHub
LUNA: a Framework for Language Understanding and Naturalness Assessment.
☆12Sep 9, 2023Updated 2 years ago
Unbabel / word-level-qe-corpus-builder
View on GitHub
Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.
☆10Sep 19, 2022Updated 3 years ago