Evals meant to evaluate language models' ability to reason over long contexts.
☆10Sep 12, 2024Updated last year
Alternatives and similar repositories for LRCBench
Users that are interested in LRCBench are comparing it to the libraries listed below
Sorting:
- Repository for tw.org site☆14Mar 14, 2026Updated last week
- Jason Meridth's blog☆13Mar 10, 2026Updated last week
- Request tiles from WMS servers that support EPSG:3857☆24May 14, 2021Updated 4 years ago
- Run Claude Code on OpenAI models☆20Jul 13, 2025Updated 8 months ago
- Redesign of solar.lowtechmagazine.com in Hugo engine☆17Apr 26, 2025Updated 10 months ago
- Evaluating LLMs by having them play games against each other☆23Sep 9, 2025Updated 6 months ago
- Code for Columbia University COMS 3997 – LLM Ethics and Foundations☆14Jan 7, 2025Updated last year
- A user interface for DSPy☆215Oct 3, 2025Updated 5 months ago
- Small, simple agent task environments for training and evaluation☆19Nov 1, 2024Updated last year
- AI for a cure, a combination of Latent-GAN and VAE-JTNN to create 100% valid drug like molecules☆10Mar 16, 2020Updated 6 years ago
- vLLM with support for span semantics☆22Feb 27, 2026Updated 3 weeks ago
- Makes it easy to use altair from FastHTML☆28Oct 9, 2024Updated last year
- Bookkeeper Portal — 🟦 Edition☆15Feb 4, 2023Updated 3 years ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆53Jul 9, 2025Updated 8 months ago
- LLM reads a paper and produce a working prototype☆62Apr 12, 2025Updated 11 months ago
- Demo for ci/cd docker in aws ECS☆11Sep 20, 2018Updated 7 years ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- Stock analysis and prediction - fundamental, quantitative, technical analysis and machine learning.☆13May 1, 2023Updated 2 years ago
- Category Theory for Quantum Natural Language Processing☆11Feb 22, 2023Updated 3 years ago
- Evaluator for the A::B Prompting Challenge☆28Apr 10, 2024Updated last year
- This repository has properties for different groups of material. The main idea is to provide accesible properties for comparison.☆14Jun 25, 2020Updated 5 years ago
- This is the code corresponding to my blog post "Generative Adversarial Networks (GANs) for Beginners: Generating Images of Distracted Dri…☆11Feb 5, 2019Updated 7 years ago
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆32Nov 7, 2024Updated last year
- ☆31Jan 18, 2025Updated last year
- A CakePHP plugin that automatically turns boring named parameters into nice looking slugs.☆17Aug 22, 2014Updated 11 years ago
- ☆12Jul 25, 2024Updated last year
- ☆20Apr 10, 2025Updated 11 months ago
- A code for calculating MBTR molecule/crystal structure representation. (https://doi.org/10.1088/2632-2153/aca005)☆13Nov 15, 2022Updated 3 years ago
- Notes and Quiz Answers of Practical Machine Learning Coursera Course☆10Jan 3, 2024Updated 2 years ago
- A web-based flashcard-style quiz application used for testing informatics and machine learning concepts☆13Jan 28, 2018Updated 8 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆45Feb 15, 2024Updated 2 years ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Sep 11, 2024Updated last year
- The mechanoChemIGA code is an isogeometric analysis based code used to solve the partial differential equations describing solid mechanic…☆14Oct 15, 2020Updated 5 years ago
- Adds several custom reports to Magento.☆10May 20, 2016Updated 9 years ago
- ☆40May 14, 2025Updated 10 months ago
- GO-MELT: GPU-Optimized Multilevel Execution of LPBF Thermal simulations☆32Dec 3, 2025Updated 3 months ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Sep 19, 2023Updated 2 years ago