FSoft-AI4Code / XMainframeLinks
Language Model for Mainframe Modernization
☆53Updated 9 months ago
Alternatives and similar repositories for XMainframe
Users that are interested in XMainframe are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆63Updated 9 months ago
- [FORGE 2025] Predicting Program Behavior with Dynamic Dependencies Learning☆24Updated 9 months ago
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation☆95Updated 9 months ago
- [ACL 2024] Novel reranking method to select the best solutions for code generation☆16Updated 11 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆85Updated 2 months ago
- ☆47Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆113Updated 3 months ago
- ☆94Updated 8 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆221Updated 7 months ago
- ☆89Updated last week
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆23Updated last month
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆51Updated last month
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆147Updated 6 months ago
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆29Updated 3 months ago
- Function Calling Benchmark & Testing☆87Updated 10 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆105Updated last month
- Generalist Software Agents to Solve Soware Engineering Tasks☆210Updated 5 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆77Updated 3 months ago
- ☆92Updated 2 months ago
- Deep Research through Multi-Agents, using GraphRAG☆71Updated 6 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 5 months ago
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆87Updated this week
- ☆143Updated 10 months ago
- ☆76Updated last year
- ☆41Updated 5 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆78Updated 2 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆69Updated 9 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 8 months ago
- LangChain, Llama2-Chat, and zero- and few-shot prompting are used to generate synthetic datasets for IR and RAG system evaluation☆37Updated last year