FSoft-AI4Code / XMainframe
Language Model for Mainframe Modernization
☆50Updated 5 months ago
Alternatives and similar repositories for XMainframe:
Users that are interested in XMainframe are comparing it to the libraries listed below
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆57Updated 5 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 7 months ago
- [FORGE 2025] Predicting Program Behavior with Dynamic Dependencies Learning☆24Updated 6 months ago
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆23Updated 2 months ago
- [ACL 2024] Novel reranking method to select the best solutions for code generation☆14Updated 8 months ago
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation☆88Updated 6 months ago
- ☆87Updated 5 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆101Updated 2 months ago
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆17Updated 2 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆101Updated last week
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆61Updated 2 months ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- ☆78Updated last month
- Elevating RAG with Multi-Agent Systems☆55Updated 3 months ago
- ☆46Updated 9 months ago
- Function Calling Benchmark & Testing☆82Updated 7 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago
- ☆48Updated 3 months ago
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆145Updated 3 months ago
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments☆44Updated this week
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆43Updated last week
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆65Updated 7 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆118Updated 6 months ago
- ☆73Updated this week
- ☆109Updated 5 months ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆90Updated 6 months ago
- ☆141Updated 7 months ago
- ☆50Updated 3 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆57Updated 11 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆86Updated this week