[EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
☆106Aug 21, 2024Updated last year
Alternatives and similar repositories for TheVault
Users that are interested in TheVault are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆44Jan 8, 2026Updated 3 months ago
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆16Jan 23, 2024Updated 2 years ago
- [ICLR 2025 - Workshop AgenticAI Oral] Large Language Models powered Neural Solvers for Generalized Vehicle Routing Problems☆27May 29, 2025Updated 11 months ago
- Open-source Self-Instruction Tuning Code LLM☆171Apr 26, 2023Updated 3 years ago
- ⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file☆51Jul 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Language Model for Mainframe Modernization☆71Aug 23, 2024Updated last year
- A list of papers and resources dedicated to code generation☆21Nov 2, 2022Updated 3 years ago
- Generalist Software Agents to Solve Soware Engineering Tasks☆242Dec 10, 2024Updated last year
- ☆12Apr 4, 2024Updated 2 years ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆26Aug 31, 2022Updated 3 years ago
- [FORGE 2025] Predicting Program Behavior with Dynamic Dependencies Learning☆30Aug 15, 2024Updated last year
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".☆17May 25, 2025Updated 11 months ago
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆23Apr 9, 2023Updated 3 years ago
- AutoPruner: Transformer-based Call Graph Pruning (ESEC/FSE 2022, Research Track)☆22Dec 7, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official implementation of "From Implicit to Explicit Feedback: A deep neural network for modeling sequential behaviours and long-short t…☆19Oct 16, 2025Updated 6 months ago
- SQL autocomplete data☆29Jun 18, 2024Updated last year
- Data and code for "Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models", which accepted in TSE.☆15Jul 3, 2024Updated last year
- Sythetic data generation and normalization functions powered by LLMs☆59Sep 19, 2024Updated last year
- pi + rainbowhat + touchscreen + usb sound card (mic or aux in) + open ai = audio logic anaklyzer☆12Apr 23, 2023Updated 3 years ago
- [FORGE 2025] Incorporating Agile methodology into agents to create complex real-world softwares☆457Oct 15, 2024Updated last year
- training BART from scratch☆12Dec 31, 2021Updated 4 years ago
- ☆26Jul 19, 2022Updated 3 years ago
- chatsnack is the easiest Python library for rapid development with OpenAI's ChatGPT API. It's an intuitive interface for creating and man…☆29Apr 14, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆26Nov 12, 2025Updated 5 months ago
- A collection of recent papers, benchmarks and datasets of AI4Code domain.☆59Apr 23, 2024Updated 2 years ago
- Demonstrates how to formulate the n-queens problem as a QUBO, which we then solve using Leap’s hybrid solvers.☆10Mar 3, 2026Updated last month
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆56Aug 28, 2023Updated 2 years ago
- Exploring and improving the quality of ChatGPT-generated code for LeetCode programming tasks.☆11Jan 19, 2024Updated 2 years ago
- ☆118Jul 17, 2024Updated last year
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Feb 16, 2026Updated 2 months ago
- GPT programs written in POWER-KI - Chat and PDF management☆19Jul 30, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reproduce the results of Tree-based Convolutional Neural Network (TBCNN)☆39Mar 25, 2023Updated 3 years ago
- Variable Selection Network with PyTorch☆11May 29, 2024Updated last year
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMs☆16Aug 12, 2025Updated 8 months ago
- A collection of datasets for machine learning for big code☆65Oct 8, 2021Updated 4 years ago
- zero-code hyperparameters optimization framework☆14Jan 25, 2024Updated 2 years ago
- ☆14May 28, 2024Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Dec 15, 2023Updated 2 years ago