SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅
☆58Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for SuperCLUE-Math6
Users that are interested in SuperCLUE-Math6 are comparing it to the libraries listed below
Sorting:
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆17May 23, 2025Updated 9 months ago
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- A small C++ library for efficient calculation of rotation invariant features in 2D images using OpenCV.☆12Feb 12, 2021Updated 5 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Image Search using BoW☆14Apr 20, 2015Updated 10 years ago
- Relational Content-Based Image Retrieval (R-CBIR) - Retrieving images with given relationships among objects☆17Oct 12, 2021Updated 4 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- This is the repository of the Ape210K dataset and baseline models.☆199Dec 10, 2019Updated 6 years ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆21Aug 13, 2024Updated last year
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆25May 29, 2025Updated 9 months ago
- 中文原生检索增强生成测评基准☆125Apr 18, 2024Updated last year
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆720Jan 7, 2025Updated last year
- ☆19Jul 15, 2022Updated 3 years ago
- Official source code for "Roto-translated Local Coordinate Frames For Interacting Dynamical Systems". In NeurIPS 2021.☆21Feb 9, 2023Updated 3 years ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆102Feb 20, 2025Updated last year
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆25May 30, 2024Updated last year
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆111May 22, 2025Updated 9 months ago
- TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of ACL 2024)☆26Jun 21, 2024Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Jun 25, 2024Updated last year
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆124Jan 30, 2026Updated last month
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆129May 16, 2025Updated 9 months ago
- Image Retrieval Experiment Using Triplet Loss☆26Dec 12, 2016Updated 9 years ago
- Various computer vision routines by Tom Botterill. Essential matrix estimation+optimisation, BoWSLAM, BaySAC, real-time local mosaicing.☆34Jan 21, 2016Updated 10 years ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆77Oct 9, 2025Updated 4 months ago
- ☆64Apr 9, 2024Updated last year
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆127Jan 14, 2025Updated last year
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024☆144Feb 24, 2025Updated last year
- image retrieval using metric learning☆10Nov 22, 2022Updated 3 years ago
- Adversarial attack on a CNN trained on MNIST dataset using Targeted I-FGSM and Targeted MI-FGM☆11Feb 17, 2018Updated 8 years ago
- ☆27Dec 3, 2025Updated 2 months ago
- ☆36Jul 7, 2025Updated 7 months ago