zorse-project / COBOLEvalLinks
Evaluate LLM-generated COBOL
☆35Updated last year
Alternatives and similar repositories for COBOLEval
Users that are interested in COBOLEval are comparing it to the libraries listed below
Sorting:
- Language Model for Mainframe Modernization☆53Updated 9 months ago
- ☆18Updated 3 weeks ago
- GraphRag vs Embeddings☆14Updated 10 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆72Updated 5 months ago
- For individual users, watsonx Code Assistant can access a local IBM Granite model☆32Updated 3 months ago
- Network Analysis through LLMs for Knowledge Extraction☆30Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- Automatic Test Generator☆12Updated 2 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆69Updated 9 months ago
- ☆26Updated last year
- ☆166Updated last year
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 2 years ago
- Leverage your LangChain trace data for fine tuning☆41Updated 10 months ago
- A collection of libraries to work with languages from Java, Kotlin, Python, Javascript, and Typescript☆36Updated 4 months ago
- [EMNLP 2024 Findings] Code for deciphering CoT using shift ciphers☆12Updated 6 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 8 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated last month
- ☆22Updated 2 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆108Updated 7 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆38Updated last week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆26Updated last year
- Python library for Evaluation☆14Updated this week
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆19Updated 2 months ago
- LLM-based mutation testing☆11Updated 4 months ago
- A repository for creating, and sample code for consuming an ONNX embedding model☆30Updated 2 years ago
- VS Code language extension for NLP++☆9Updated this week
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆81Updated last year
- create workflows with LLMs☆54Updated 10 months ago
- LLM sampling method for enforcing syntax adherence in generated output☆25Updated 2 years ago