Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Language Models"
☆39Dec 18, 2023Updated 2 years ago
Alternatives and similar repositories for JEEBench
Users that are interested in JEEBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"☆24Oct 8, 2023Updated 2 years ago
- ☆46Oct 11, 2023Updated 2 years ago
- A basic weather prediction software powered by TensorFlow☆15Dec 5, 2016Updated 9 years ago
- ☆17Oct 31, 2023Updated 2 years ago
- DIY Python Projects☆10Aug 26, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- ☆14Jan 4, 2021Updated 5 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆15Jan 24, 2023Updated 3 years ago
- Method for evaluating system summaries manually, via crowdsourcing, using a summarization dataset that includes reference summaries.☆12May 5, 2019Updated 6 years ago
- ☆13Oct 20, 2017Updated 8 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- Heroes of the Storm data in json format☆15Feb 20, 2026Updated last month
- A formal proof of the irrationality of zeta(3), the Apéry constant [maintainer=@amahboubi,@pi8027]☆26Mar 3, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆17Jul 16, 2024Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Mar 30, 2026Updated 2 weeks ago
- Data and code for the SciFact-Open task☆29Nov 24, 2023Updated 2 years ago
- ipython-notebooks on popular algorithms meant to be used at technical sessions for IITB students☆28Apr 9, 2017Updated 9 years ago
- This repo contains the code for our paper "Iterative Edit-Based Unsupervised Sentence Simplification" accepted at ACL 2020.☆14Jul 19, 2021Updated 4 years ago
- Code for the paper "Learning to Prove Theorems by Learning to Generate Theorems"☆33Oct 30, 2020Updated 5 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- ☆18Jan 17, 2024Updated 2 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆24Oct 26, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Generate ics file given a set of courses and slots☆12Sep 16, 2024Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated 2 years ago
- Explainable Interactive Concept Learning☆15Mar 26, 2023Updated 3 years ago
- Multimodal extreme classification☆21May 1, 2024Updated last year
- Official repo for ACL 2023 paper Code4Struct: Code Generation for Few-Shot Structured Prediction from Natural Language.☆43Jan 7, 2024Updated 2 years ago
- The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models☆53Apr 8, 2026Updated last week
- Tensorflow implementation of the `intelligent synapse' model from [Zenke et al., (2017)] and application to the Permuted MNIST benchmark.☆22Aug 2, 2017Updated 8 years ago
- ☆27May 11, 2023Updated 2 years ago
- IIT-JEE Name wise Result☆33Aug 9, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- NaturalProofs: Mathematical Theorem Proving in Natural Language (NeurIPS 2021 Datasets & Benchmarks)☆135Sep 8, 2022Updated 3 years ago
- ☆18Nov 1, 2023Updated 2 years ago
- Papers on Topology + Learning.☆14Feb 12, 2020Updated 6 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆12Jun 5, 2024Updated last year
- This repository contains implementation of CROSSGRAD (https://openreview.net/forum?id=r1Dx7fbCW) and DAN (https://arxiv.org/abs/1505.0781…☆24Dec 28, 2018Updated 7 years ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year