Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Language Models"
☆39Dec 18, 2023Updated 2 years ago
Alternatives and similar repositories for JEEBench
Users that are interested in JEEBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"☆26Oct 8, 2023Updated 2 years ago
- JEEBench, EMNLP 2023☆48Dec 18, 2023Updated 2 years ago
- A basic weather prediction software powered by TensorFlow☆15Dec 5, 2016Updated 9 years ago
- Convert your docs to markdown format.☆14Jul 26, 2016Updated 9 years ago
- DIY Python Projects☆10Aug 26, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- ☆14Jan 4, 2021Updated 5 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 4 years ago
- The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"☆11May 6, 2021Updated 5 years ago
- Aggregation of SymPy related blogs☆16Updated this week
- Larger-Context NMT☆13Aug 20, 2017Updated 8 years ago
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆15Jan 24, 2023Updated 3 years ago
- Method for evaluating system summaries manually, via crowdsourcing, using a summarization dataset that includes reference summaries.☆12May 5, 2019Updated 7 years ago
- ☆13Oct 20, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- This repo contains the code for our paper "Iterative Edit-Based Unsupervised Sentence Simplification" accepted at ACL 2020.☆14Jul 19, 2021Updated 4 years ago
- Code for the paper "Learning to Prove Theorems by Learning to Generate Theorems"☆33Oct 30, 2020Updated 5 years ago
- prediction markets -> llm -> news☆25Nov 10, 2025Updated 7 months ago
- Scripts for finetuning m2m-100 models☆19Jul 28, 2022Updated 3 years ago
- Youtube playlist maker webapp - make lightning fast playlists and share them!☆30May 4, 2018Updated 8 years ago
- Generate ics file given a set of courses and slots☆12Sep 16, 2024Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆31Mar 5, 2024Updated 2 years ago
- Explainable Interactive Concept Learning☆15Mar 26, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repo for ACL 2023 paper Code4Struct: Code Generation for Few-Shot Structured Prediction from Natural Language.☆43Jan 7, 2024Updated 2 years ago
- The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models☆60Apr 8, 2026Updated 2 months ago
- Tensorflow implementation of the `intelligent synapse' model from [Zenke et al., (2017)] and application to the Permuted MNIST benchmark.☆22Aug 2, 2017Updated 8 years ago
- IIT-JEE Name wise Result☆33Aug 9, 2021Updated 4 years ago
- NaturalProofs: Mathematical Theorem Proving in Natural Language (NeurIPS 2021 Datasets & Benchmarks)☆137Sep 8, 2022Updated 3 years ago
- A Concept-Centric Framework for Intelligent Agents☆27Oct 1, 2025Updated 8 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"☆30Jun 10, 2018Updated 8 years ago
- Integrating Deep Neural Networks and Symbolic Inference for Organic Reactivity Prediction☆13Jan 8, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆32Jul 9, 2024Updated last year
- A package dedicated for running benchmark agreement testing☆19Sep 18, 2025Updated 9 months ago
- Delete Unwanted Bibliography fields from bibtex (.bib) files☆24Dec 24, 2018Updated 7 years ago
- ☆12Jun 5, 2024Updated 2 years ago
- This repository contains implementation of CROSSGRAD (https://openreview.net/forum?id=r1Dx7fbCW) and DAN (https://arxiv.org/abs/1505.0781…☆24Dec 28, 2018Updated 7 years ago
- A terminal-based Youtube song search and downloader using youtube-dl☆11Mar 13, 2017Updated 9 years ago
- ☆14Mar 31, 2022Updated 4 years ago