zorse-project / COBOLEvalLinks

Evaluate LLM-generated COBOL

☆39

Alternatives and similar repositories for COBOLEval

Users that are interested in COBOLEval are comparing it to the libraries listed below

Sorting:

FSoft-AI4Code / XMainframe
Language Model for Mainframe Modernization
☆57Updated 10 months ago
vl2g / floco
Flow Chart Image-to-Code Generation
☆33Updated last year
yale-nlp / SciArena
☆37Updated last week
LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆54Updated last year
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆65Updated 2 years ago
guidance-ai / jsonschemabench
☆46Updated last month
SalesforceAIResearch / CRMArena
Official Repo for CRMArena and CRMArena-Pro
☆99Updated 2 weeks ago
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated 2 weeks ago
quantalogic / qllm
QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…
☆33Updated 3 months ago
mkuchnik / relm
ReLM is a Regular Expression engine for Language Models
☆106Updated 2 years ago
NL2Code / CodeM
☆44Updated last year
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆108Updated 3 months ago
ibm-granite / granite-guardian
The Granite Guardian models are designed to detect risks in prompts and responses.
☆88Updated 2 weeks ago
IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆61Updated 2 months ago
parea-ai / parea-sdk-py
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
☆78Updated 5 months ago
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆63Updated 2 months ago
Arize-ai / open-inference-spec
A specification for OpenInference, a semantic mapping of ML inferences
☆47Updated last year
jlscheerer / xtr-warp
XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.
☆137Updated 2 months ago
FSoft-AI4Code / RepoHyper
[FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository
☆63Updated 10 months ago
lancedb / ragged
☆20Updated 8 months ago
cohere-ai / quick-start-connectors
This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…
☆151Updated 9 months ago
explodinggradients / Funtuner
Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆35Updated 2 years ago
egozverev / Should-It-Be-Executed-Or-Processed
Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.
☆54Updated 4 months ago
robocorp / llmstatemachine
A Python library for building GPT-powered agents with state machine logic and chat history memory.
☆67Updated last year
Columbia-NLP-Lab / PAPILLON
Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles
☆50Updated 2 months ago
jina-ai / textbook
distill chatGPT coding ability into small model (1b)
☆30Updated last year
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
amazon-science / CodeSage
CodeSage: Code Representation Learning At Scale (ICLR 2024)
☆109Updated 8 months ago
kuzudb / graph-rag
Repo to experiment with Graph RAG strategies using Kùzu
☆53Updated 7 months ago
VITA-Group / ChainCoder
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆40Updated last year