FSoft-AI4Code / XMainframeLinks
Language Model for Mainframe Modernization
☆54Updated 10 months ago
Alternatives and similar repositories for XMainframe
Users that are interested in XMainframe are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆63Updated 9 months ago
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation☆96Updated 10 months ago
- [ACL 2024] Novel reranking method to select the best solutions for code generation☆16Updated last year
- all code examples in the blog posts☆21Updated 5 months ago
- [FORGE 2025] Predicting Program Behavior with Dynamic Dependencies Learning☆24Updated 10 months ago
- Official Repo for CRMArena and CRMArena-Pro☆92Updated last week
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 6 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆70Updated 6 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆23Updated 2 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆106Updated 2 months ago
- ☆50Updated 3 weeks ago
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆71Updated 7 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆115Updated 4 months ago
- Building a Chain of Thought RAG Model with DSPy, Qdrant and Ollama☆32Updated last year
- Generate Tools and Toolkits from any Python SDK -- no extra code required☆52Updated 7 months ago
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆29Updated 3 months ago
- ☆96Updated 9 months ago
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆109Updated 8 months ago
- ☆72Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 8 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated last year
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆119Updated 10 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆80Updated 3 months ago
- ☆47Updated last year
- ☆29Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆35Updated last month
- Visualize any repo or codebase into diagram or animation☆18Updated 8 months ago