plasma-umass / pythonessLinks
Pythoness: use natural language to define Python functions.
☆20Updated 5 months ago
Alternatives and similar repositories for pythoness
Users that are interested in pythoness are comparing it to the libraries listed below
Sorting:
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆48Updated last month
- Incremental Python parser for constrained generation of code by LLMs.☆17Updated last year
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated 3 weeks ago
- Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]☆74Updated 4 months ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆120Updated 6 months ago
- [ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environment☆133Updated 6 months ago
- Use context-free grammars with an LLM☆175Updated last year
- ☆98Updated 2 months ago
- Query language for blending SQL and LLMs across structured + unstructured data, with type constraints.☆114Updated last week
- Code and Data for: Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming☆32Updated last year
- Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multis…☆275Updated last year
- LLM verified with Monte Carlo Tree Search☆280Updated 6 months ago
- Binary Python wheels for all tree sitter languages.☆244Updated 8 months ago
- Automatic AI-powered test suite generator☆92Updated 2 months ago
- ☆59Updated 3 weeks ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆218Updated last week
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- ☆117Updated 4 months ago
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆56Updated last year
- ☆28Updated 4 months ago
- Enriched Python function call graphs for agents and coding assistants☆121Updated 3 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 11 months ago
- Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles☆44Updated last month
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆75Updated last year
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆58Updated last month
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated 3 months ago
- An explainable inference software supporting annotated, real valued, graph based and temporal logic☆301Updated this week
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆39Updated this week
- Repilot, a patch generation tool introduced in the ESEC/FSE'23 paper "Copiloting the Copilots: Fusing Large Language Models with Completi…☆133Updated 2 years ago