[ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs
☆45Jul 27, 2024Updated last year
Alternatives and similar repositories for CHARM
Users that are interested in CHARM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SDK of OpenDataLab - https://opendatalab.org.cn☆59Jul 31, 2025Updated 9 months ago
- ThinkGeo is a Comprehensive Benchmark to evaluate Tool-Augmented Agents for Remote Sensing Tasks☆67Apr 2, 2026Updated last month
- ☆16Mar 25, 2024Updated 2 years ago
- (NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation☆67Oct 14, 2025Updated 6 months ago
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆47Aug 22, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆26Jun 28, 2024Updated last year
- Towards Foundation Models for Mixed Integer Linear Programming☆16Feb 3, 2025Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆49Mar 7, 2024Updated 2 years ago
- 面向大模型的民族文化数据集☆12May 26, 2025Updated 11 months ago
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 4 months ago
- LogicBench is a natural language question-answering dataset consisting of 25 different reasoning patterns spanning over propositional, fi…☆38May 2, 2024Updated 2 years ago
- [arXiv 2024] ChangeAnywhere: Sample generation for remote sensing change detection via semantic latent diffusion model☆30May 27, 2025Updated 11 months ago
- a within-document event coreference resolution system, trained and evaluated on the KBP corpus.☆10May 15, 2023Updated 2 years ago
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection☆37Jul 2, 2025Updated 10 months ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆10Mar 1, 2025Updated last year
- ☆12Jun 30, 2024Updated last year
- Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images☆51Aug 26, 2025Updated 8 months ago
- Code and updates for the ScoreRS project.☆42Sep 19, 2025Updated 7 months ago
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 2 years ago
- Experiment and analysis code to assess the usefulness of the EyeTribe tracker in psychological research.☆15Oct 30, 2014Updated 11 years ago
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Collection of Remote Sensing Vision-Language Models☆142May 13, 2024Updated last year
- 儿童故事常识推理与寓意理解评测(Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories,CRMU)☆18Oct 22, 2024Updated last year
- AAAI 2024: Visual Instruction Generation and Correction☆96Feb 4, 2024Updated 2 years ago
- code for Imagination-Policy☆15Dec 1, 2024Updated last year
- ☆13Mar 30, 2026Updated last month
- ☆46Sep 13, 2025Updated 7 months ago
- SPCL: A New Framework for Domain Adaptive Semantic Segmentation via Semantic Prototype-based Contrastive Learning https://arxiv.org/abs/2…☆11Jun 24, 2022Updated 3 years ago
- [XLLM@ACL2025] Official Code for "Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided Distillation"☆21Jul 29, 2025Updated 9 months ago
- Leonardo Citraro, Mateusz Kozinski, Pascal Fua, Towards Reliable Evaluation of Road Network Reconstructions, ECCV 2020☆11Aug 21, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Jul 16, 2021Updated 4 years ago
- ☆42Jan 3, 2025Updated last year
- ☆14Sep 7, 2023Updated 2 years ago
- Change Detection towards Bitemporal Quality Difference via Hierarchical Correlation Distillation☆10Apr 30, 2024Updated 2 years ago
- This repo has moved to https://github.com/haosulab/ManiSkill☆17May 28, 2025Updated 11 months ago
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆30Apr 2, 2025Updated last year
- ☆10Nov 15, 2023Updated 2 years ago