Logical Operations On Puzzles: Simple Iterative Reasoning Tests for LLMs first through wordgrids
☆18Feb 19, 2025Updated last year
Alternatives and similar repositories for LOOP-Evals
Users that are interested in LOOP-Evals are comparing it to the libraries listed below
Sorting:
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆17May 7, 2024Updated last year
- Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"☆15Aug 26, 2024Updated last year
- DGCIT: Double Generative Adversarial Networks for Conditional Independence Testing☆11Nov 22, 2023Updated 2 years ago
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago
- "Causality: Models, Reasoning, and Inference-Judea Pearl(2009)"中文翻译及学习笔记☆15Feb 18, 2022Updated 4 years ago
- Adds syntax to racket languages☆11Aug 17, 2022Updated 3 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- ☆11Feb 2, 2024Updated 2 years ago
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 8 months ago
- Hi, I'm Harmony the Hummingbird! Let's work together on whatever you care about.☆12May 3, 2024Updated last year
- A novel incremental hierarchical clustering algorithm (KDD 22)☆10Aug 31, 2023Updated 2 years ago
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 3 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- ☆11Jul 15, 2020Updated 5 years ago
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- [NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed☆27Jul 26, 2025Updated 7 months ago
- Everything you need to reproduce "Better plain ViT baselines for ImageNet-1k" in PyTorch, and more☆12Updated this week
- ☆11Jun 5, 2024Updated last year
- A simple script to add pdf-files to Zotero via CLI☆12May 17, 2020Updated 5 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 7 months ago
- Code and data to support Bamman et al. (2020), "A Dataset of Literary Coreference" (LREC)☆10Dec 8, 2022Updated 3 years ago
- ☆12Jun 21, 2024Updated last year
- Data structures for describing changes to other data structures.☆17Jan 19, 2025Updated last year
- MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments☆13Jul 8, 2024Updated last year
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- code for the NAACL 2021 paper Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention by Microsoft S…☆12Apr 21, 2023Updated 2 years ago
- A Redex tutorial with a focus on how to do work in Redex☆11Oct 21, 2024Updated last year
- An inference server for Bark☆12Sep 22, 2023Updated 2 years ago
- analysis of public NLP corpora☆11Feb 9, 2023Updated 3 years ago
- ☆23Feb 8, 2026Updated last month
- 日志增量聚类算法,用于日志异常检测☆12Aug 20, 2022Updated 3 years ago
- A list of Numerical Multimodal reasoning papers and their implementation☆11May 13, 2024Updated last year
- ☆11Jan 16, 2024Updated 2 years ago
- Python wrapper for the Java-based Maximal Information-based Nonparametric Exploration (MINE) statistics library☆19Feb 3, 2012Updated 14 years ago
- Script for merging LaTeX files and stripping comments, in preparation for submission to ArXiV☆11May 23, 2014Updated 11 years ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- recipe for training fully-featured self supervised image jepa models☆12Jun 4, 2025Updated 9 months ago