plasma-umass / pythoness
Pythoness: use natural language to define Python functions.
☆18Updated 3 weeks ago
Alternatives and similar repositories for pythoness
Users that are interested in pythoness are comparing it to the libraries listed below
Sorting:
- Benchmark structured generation libraries☆27Updated 6 months ago
- Building Agents with LLM structured generation (BAML), MCP Tools, and 12-Factor Agents principles☆11Updated this week
- Allows to check regexes for overlaps. Based on greenery by @qntm.☆52Updated 11 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆35Updated this week
- ☆18Updated last year
- Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles☆43Updated this week
- ☆28Updated 7 months ago
- ☆54Updated 7 months ago
- ☆22Updated last week
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 2 years ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated last month
- Incremental Python parser for constrained generation of code by LLMs.☆16Updated 8 months ago
- LMQL implementation of tree of thoughts☆34Updated last year
- A curated collection of example marimo notebooks — use these as templates for your own experiments, workflows, and tools.☆42Updated this week
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 9 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- Sphynx Hallucination Induction☆54Updated 3 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆57Updated last month
- ☆49Updated this week
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Updated 9 months ago
- A fork of sqlite-utils with CLI etc removed☆15Updated last month
- Chat Markup Language conversation library☆55Updated last year
- ✅ Pytest-style test runner for langchain projects☆25Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Updated last month
- Clover: Closed-Loop Verifiable Code Generation☆35Updated this week
- ☆22Updated last year
- Some tough questions to test new models.☆28Updated last year
- ☆45Updated 7 months ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆48Updated last month