Probabilistic LLM evaluations. [CogSci2023; ACL2023]
☆73Jul 27, 2024Updated last year
Alternatives and similar repositories for probsem
Users that are interested in probsem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A scalable abstraction learning library☆89Sep 10, 2025Updated 7 months ago
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 3 years ago
- [ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks☆61Aug 2, 2023Updated 2 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Sep 13, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Logical inference system based on event semantics and degree semantics in formal semantics☆10Jan 22, 2023Updated 3 years ago
- Using self-play to augment multi-turn text-to-SQL datasets☆11Oct 20, 2022Updated 3 years ago
- A tool for cross-checking Verilog compilers☆15Apr 16, 2025Updated last year
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Code and Data for Evaluation WG☆42May 4, 2022Updated 4 years ago
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- An offical implementation of EHRDiff [TMLR]☆34Jun 25, 2024Updated last year
- ☆18Apr 15, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- A framework for nonlinear continuous-time regression☆41Jan 22, 2025Updated last year
- ☆10Jun 11, 2019Updated 6 years ago
- ☆30Oct 2, 2023Updated 2 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- structured attention encoder☆13Jun 6, 2018Updated 7 years ago
- [NO LONGER MAINTAINED, SUPERSEDED BY https://github.com/trueagi-io/pln-experimental and https://github.com/trueagi-io/PLN]. Probabilisti…☆16Sep 20, 2025Updated 7 months ago
- ☆19Nov 7, 2022Updated 3 years ago
- codesearch.ai semantic code search engine☆42Mar 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Probabilistic programming with large language models☆170Apr 9, 2026Updated 3 weeks ago
- A rule engine based on Attempto Controlled English☆18Nov 1, 2024Updated last year
- ☆23Jan 27, 2025Updated last year
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 8 years ago
- 基于Metropolis修改的北大风格Beamer主题☆28May 29, 2022Updated 3 years ago
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆14May 25, 2023Updated 2 years ago
- A Julia package for differentiating through expectations with Monte-Carlo estimates☆16Nov 25, 2024Updated last year
- Data and all☆14Sep 30, 2019Updated 6 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Not financial advice.☆28Mar 18, 2023Updated 3 years ago
- ☆44Jun 24, 2025Updated 10 months ago
- Code for EMNLP-IJCNLP 2019 MRQA Workshop Paper: "Domain-agnostic Question-Answering with Adversarial Training"☆40Jul 25, 2024Updated last year
- A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …☆1,014Dec 16, 2024Updated last year
- Comprehensive LLM evaluation framework: GPQA Diamond to Chatbot Arena. Tests all major models equally, easily extensible.☆17Aug 22, 2024Updated last year
- Composable inference algorithms with LLMs and programmable logic☆70Dec 4, 2024Updated last year
- Data and related code for ACL2019 paper "Implicit Discourse Relation Identification for Open-domain Dialogues"☆12Jul 29, 2019Updated 6 years ago