[EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
☆105Aug 21, 2024Updated last year
Alternatives and similar repositories for TheVault
Users that are interested in TheVault are comparing it to the libraries listed below
Sorting:
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆43Jan 8, 2026Updated 2 months ago
- [ACL 2024] Novel reranking method to select the best solutions for code generation☆16Jun 9, 2024Updated last year
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆15Jan 23, 2024Updated 2 years ago
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆29Apr 21, 2025Updated 10 months ago
- [ICLR 2025 - Workshop AgenticAI Oral] Large Language Models powered Neural Solvers for Generalized Vehicle Routing Problems☆27May 29, 2025Updated 9 months ago
- Open-source Self-Instruction Tuning Code LLM☆172Apr 26, 2023Updated 2 years ago
- ⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file☆51Jul 1, 2024Updated last year
- Language Model for Mainframe Modernization☆68Aug 23, 2024Updated last year
- Generalist Software Agents to Solve Soware Engineering Tasks☆236Dec 10, 2024Updated last year
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆23Apr 9, 2023Updated 2 years ago
- AutoPruner: Transformer-based Call Graph Pruning (ESEC/FSE 2022, Research Track)☆22Dec 7, 2023Updated 2 years ago
- Data and code for "Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models", which accepted in TSE.☆15Jul 3, 2024Updated last year
- SQL autocomplete data☆29Jun 18, 2024Updated last year
- Sythetic data generation and normalization functions powered by LLMs☆59Sep 19, 2024Updated last year
- pi + rainbowhat + touchscreen + usb sound card (mic or aux in) + open ai = audio logic anaklyzer☆12Apr 23, 2023Updated 2 years ago
- [FORGE 2025] Incorporating Agile methodology into agents to create complex real-world softwares☆453Oct 15, 2024Updated last year
- training BART from scratch☆12Dec 31, 2021Updated 4 years ago
- ☆26Jul 19, 2022Updated 3 years ago
- chatsnack is the easiest Python library for rapid development with OpenAI's ChatGPT API. It's an intuitive interface for creating and man…☆29Updated this week
- ☆26Nov 12, 2025Updated 4 months ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- Buzz AI, aka gt-chat, is a fast and intuitive question-answering chatbot for Georgia Tech. Powered by Next.js, FastAPI, and OpenAI, it so…☆30Apr 13, 2023Updated 2 years ago
- Exploring and improving the quality of ChatGPT-generated code for LeetCode programming tasks.☆11Jan 19, 2024Updated 2 years ago
- ☆118Jul 17, 2024Updated last year
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆58Oct 27, 2023Updated 2 years ago
- GPT programs written in POWER-KI - Chat and PDF management☆19Jul 30, 2025Updated 7 months ago
- Reproduce the results of Tree-based Convolutional Neural Network (TBCNN)☆39Mar 25, 2023Updated 2 years ago
- Chronos: Zero-Shot Identification of Libraries from Vulnerability Reports (ICSE 2023, Technical Track)☆11Jul 23, 2023Updated 2 years ago
- Variable Selection Network with PyTorch☆11May 29, 2024Updated last year
- A collection of datasets for machine learning for big code☆62Oct 8, 2021Updated 4 years ago
- ☆17Aug 23, 2022Updated 3 years ago
- ☆18Jan 26, 2022Updated 4 years ago
- zero-code hyperparameters optimization framework☆14Jan 25, 2024Updated 2 years ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 5 months ago
- ☆14May 28, 2024Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Dec 15, 2023Updated 2 years ago
- ☆21May 2, 2023Updated 2 years ago
- Like Duolingo, but better☆39May 5, 2023Updated 2 years ago