Evals meant to evaluate language models' ability to reason over long contexts.
☆10Sep 12, 2024Updated last year
Alternatives and similar repositories for LRCBench
Users that are interested in LRCBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for tw.org site☆14Jun 23, 2026Updated last week
- Jason Meridth's blog☆14Jun 23, 2026Updated last week
- Request tiles from WMS servers that support EPSG:3857☆24May 14, 2021Updated 5 years ago
- Run Claude Code on OpenAI models☆20Jul 13, 2025Updated 11 months ago
- Redesign of solar.lowtechmagazine.com in Hugo engine☆17Apr 26, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Evaluating LLMs by having them play games against each other☆23Sep 9, 2025Updated 9 months ago
- Code for Columbia University COMS 3997 – LLM Ethics and Foundations☆16Jan 7, 2025Updated last year
- A user interface for DSPy☆225Jun 6, 2026Updated 3 weeks ago
- Small, simple agent task environments for training and evaluation☆20Nov 1, 2024Updated last year
- AI for a cure, a combination of Latent-GAN and VAE-JTNN to create 100% valid drug like molecules☆10Mar 16, 2020Updated 6 years ago
- vLLM with support for span semantics☆23Feb 27, 2026Updated 4 months ago
- Makes it easy to use altair from FastHTML☆28Oct 9, 2024Updated last year
- Bookkeeper Portal — 🟦 Edition☆15Feb 4, 2023Updated 3 years ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Jul 9, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- LLM reads a paper and produce a working prototype☆63Apr 12, 2025Updated last year
- Demo for ci/cd docker in aws ECS☆11Sep 20, 2018Updated 7 years ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- Stock analysis and prediction - fundamental, quantitative, technical analysis and machine learning.☆13May 1, 2023Updated 3 years ago
- Category Theory for Quantum Natural Language Processing☆11Feb 22, 2023Updated 3 years ago
- Evaluator for the A::B Prompting Challenge☆28Apr 10, 2024Updated 2 years ago
- This repository has properties for different groups of material. The main idea is to provide accesible properties for comparison.☆14Jun 25, 2020Updated 6 years ago
- This is the code corresponding to my blog post "Generative Adversarial Networks (GANs) for Beginners: Generating Images of Distracted Dri…☆11Feb 5, 2019Updated 7 years ago
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆33Nov 7, 2024Updated last year
- ☆31Jan 18, 2025Updated last year
- A CakePHP plugin that automatically turns boring named parameters into nice looking slugs.☆17Aug 22, 2014Updated 11 years ago
- ☆12Jul 25, 2024Updated last year
- ☆20Apr 10, 2025Updated last year
- A code for calculating MBTR molecule/crystal structure representation. (https://doi.org/10.1088/2632-2153/aca005)☆14Nov 15, 2022Updated 3 years ago
- Notes and Quiz Answers of Practical Machine Learning Coursera Course☆11Jan 3, 2024Updated 2 years ago
- A web-based flashcard-style quiz application used for testing informatics and machine learning concepts☆13Jan 28, 2018Updated 8 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆45Feb 15, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Sep 11, 2024Updated last year
- The mechanoChemIGA code is an isogeometric analysis based code used to solve the partial differential equations describing solid mechanic…☆14Oct 15, 2020Updated 5 years ago
- Adds several custom reports to Magento.☆10May 20, 2016Updated 10 years ago
- ☆40May 14, 2025Updated last year
- GO-MELT: GPU-Optimized Multilevel Execution of LPBF Thermal simulations☆35Mar 24, 2026Updated 3 months ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Post-processing toolkit for electronic structure calculations☆18Mar 17, 2026Updated 3 months ago