zorse-project / COBOLEval
Evaluate LLM-generated COBOL
☆30Updated 10 months ago
Alternatives and similar repositories for COBOLEval:
Users that are interested in COBOLEval are comparing it to the libraries listed below
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated 2 months ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆30Updated last week
- LLM finetuning☆42Updated last year
- Language Model for Mainframe Modernization☆50Updated 6 months ago
- Prompt Declaration Language (PDL) is a declarative prompt programming language.☆131Updated this week
- Training hybrid models for dummies.☆20Updated last month
- For individual users, watsonx Code Assistant can access a local IBM Granite model☆28Updated 3 weeks ago
- ☆19Updated 11 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆70Updated 2 weeks ago
- The official evaluation suite and dynamic data release for MixEval.☆10Updated 5 months ago
- Python library for Evaluation☆12Updated 2 weeks ago
- The Arcee client for executing domain-adpated language model routines https://pypi.org/project/arcee-py/☆27Updated 5 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆29Updated this week
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆64Updated 6 months ago
- Benchmark structured generation libraries☆26Updated 4 months ago
- ☆26Updated last year
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- Vector Database with support for late interaction and token level embeddings.☆53Updated 5 months ago
- Create embeddings for LLM using the Nomic API☆22Updated 3 months ago
- ☆37Updated last month
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated last month
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆29Updated 7 months ago
- Python library for Synthetic Data Generation☆34Updated last week
- One Line To Build Zero-Data Classifiers in Minutes☆36Updated 5 months ago
- ☆111Updated last month
- Harness used to benchmark aider against SWE Bench benchmarks☆66Updated 8 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year