Probabilistic LLM evaluations. [CogSci2023; ACL2023]
☆74Jul 27, 2024Updated last year
Alternatives and similar repositories for probsem
Users that are interested in probsem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Get language models to generate responses in a specific format reliably. Open source implementation of Synchromesh: Reliable code generat…☆33Feb 26, 2024Updated 2 years ago
- Translation of the databricks-dolly-15k dataset to Chinese for commercial use.☆19Apr 17, 2023Updated 3 years ago
- A scalable abstraction learning library☆92Sep 10, 2025Updated 9 months ago
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 3 years ago
- [ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks☆62Aug 2, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆18Jan 3, 2022Updated 4 years ago
- ☆13Apr 17, 2025Updated last year
- Implementation of MixCE method described in ACL 2023 paper by Zhang et al.☆20May 29, 2023Updated 3 years ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆169May 7, 2024Updated 2 years ago
- Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1☆14Mar 27, 2024Updated 2 years ago
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Sep 13, 2023Updated 2 years ago
- Code to reproduce experiments in the paper "Constrained Language Models Yield Few-Shot Semantic Parsers" (EMNLP 2021).☆67May 31, 2024Updated 2 years ago
- This is the repository for the paper Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descripti…☆25Nov 18, 2022Updated 3 years ago
- Using self-play to augment multi-turn text-to-SQL datasets☆11Oct 20, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"☆11May 23, 2023Updated 3 years ago
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆12Nov 1, 2022Updated 3 years ago
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- Language of thought library for python 3☆51Feb 14, 2024Updated 2 years ago
- Code and Data for Evaluation WG☆42May 4, 2022Updated 4 years ago
- An offical implementation of EHRDiff [TMLR]☆33Jun 25, 2024Updated last year
- ☆18Apr 15, 2024Updated 2 years ago
- ☆25Aug 2, 2022Updated 3 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LLM as World Models using Bayesian inference☆20May 27, 2025Updated last year
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- ☆30Oct 2, 2023Updated 2 years ago
- The simple and easy javascript library used for building UI for the web using JSON 🔥☆15Jun 11, 2021Updated 5 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- structured attention encoder☆13Jun 6, 2018Updated 8 years ago
- ☆19Nov 7, 2022Updated 3 years ago
- a Haskell library that implements (Projective) Discourse Representation Theory (DRT)☆27Sep 15, 2022Updated 3 years ago
- Make triton easier☆50Jun 12, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- codesearch.ai semantic code search engine☆42Mar 24, 2023Updated 3 years ago
- ☆26Apr 15, 2023Updated 3 years ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44May 23, 2026Updated 3 weeks ago
- Probabilistic programming with large language models☆174Jun 7, 2026Updated last week
- OpenAI API Client in D Programming Language☆18Jun 29, 2025Updated 11 months ago
- Language-annotated Abstraction and Reasoning Corpus☆99Mar 24, 2026Updated 2 months ago
- A rule engine based on Attempto Controlled English☆18Nov 1, 2024Updated last year