plasma-umass / pythonessLinks
Pythoness: use natural language to define Python functions.
☆21Updated 8 months ago
Alternatives and similar repositories for pythoness
Users that are interested in pythoness are comparing it to the libraries listed below
Sorting:
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆48Updated 3 months ago
- Use context-free grammars with an LLM☆175Updated last year
- Enriched Python function call graphs for agents and coding assistants☆122Updated 6 months ago
- Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]☆80Updated 6 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated 2 months ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multis…☆277Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆95Updated 2 months ago
- Incremental Python parser for constrained generation of code by LLMs.☆18Updated last year
- ☆23Updated last year
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆64Updated last week
- Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles☆44Updated 3 months ago
- Query language for blending SQL and LLMs across structured + unstructured data, with type constraints.☆122Updated last week
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆42Updated last month
- LLM verified with Monte Carlo Tree Search☆284Updated 8 months ago
- Evaluate LLM-generated COBOL☆41Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆81Updated 10 months ago
- Binary Python wheels for all tree sitter languages.☆254Updated 10 months ago
- An attribution library for LLMs☆46Updated last year
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆79Updated last year
- Benchmark structured generation libraries☆30Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆164Updated 8 months ago
- Sphynx Hallucination Induction☆53Updated 10 months ago
- ☆32Updated 6 months ago
- Foyle is a copilot to help developers deploy and operate their applications.☆132Updated 9 months ago
- Automatic AI-powered test suite generator☆98Updated last month
- Generate code Python source cross-reference facts in Kythe format☆25Updated last year
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆79Updated 11 months ago
- ☆78Updated last year