[EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
☆105Aug 21, 2024Updated last year
Alternatives and similar repositories for TheVault
Users that are interested in TheVault are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2024] Novel reranking method to select the best solutions for code generation☆16Jun 9, 2024Updated last year
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆15Jan 23, 2024Updated 2 years ago
- [ICLR 2025 - Workshop AgenticAI Oral] Large Language Models powered Neural Solvers for Generalized Vehicle Routing Problems☆27May 29, 2025Updated 10 months ago
- Open-source Self-Instruction Tuning Code LLM☆172Apr 26, 2023Updated 2 years ago
- ⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file☆51Jul 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Language Model for Mainframe Modernization☆70Aug 23, 2024Updated last year
- A list of papers and resources dedicated to code generation☆20Nov 2, 2022Updated 3 years ago
- ☆12Apr 4, 2024Updated 2 years ago
- [FORGE 2025] Predicting Program Behavior with Dynamic Dependencies Learning☆28Aug 15, 2024Updated last year
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".☆17May 25, 2025Updated 10 months ago
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆23Apr 9, 2023Updated 3 years ago
- Official implementation of "From Implicit to Explicit Feedback: A deep neural network for modeling sequential behaviours and long-short t…☆19Oct 16, 2025Updated 5 months ago
- Data and code for "Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models", which accepted in TSE.☆15Jul 3, 2024Updated last year
- Sythetic data generation and normalization functions powered by LLMs☆59Sep 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- pi + rainbowhat + touchscreen + usb sound card (mic or aux in) + open ai = audio logic anaklyzer☆12Apr 23, 2023Updated 2 years ago
- training BART from scratch☆12Dec 31, 2021Updated 4 years ago
- ☆26Jul 19, 2022Updated 3 years ago
- chatsnack is the easiest Python library for rapid development with OpenAI's ChatGPT API. It's an intuitive interface for creating and man…☆29Updated this week
- ☆16Jun 5, 2023Updated 2 years ago
- ☆26Nov 12, 2025Updated 4 months ago
- A collection of recent papers, benchmarks and datasets of AI4Code domain.☆59Apr 23, 2024Updated last year
- Yet another LLM☆10Apr 6, 2023Updated 3 years ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆56Aug 28, 2023Updated 2 years ago
- Buzz AI, aka gt-chat, is a fast and intuitive question-answering chatbot for Georgia Tech. Powered by Next.js, FastAPI, and OpenAI, it so…☆30Apr 13, 2023Updated 2 years ago
- Trials of pre-trained BERT models for the medical domain in Japanese.☆12Nov 21, 2020Updated 5 years ago
- Exploring and improving the quality of ChatGPT-generated code for LeetCode programming tasks.☆11Jan 19, 2024Updated 2 years ago
- ☆118Jul 17, 2024Updated last year
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆58Oct 27, 2023Updated 2 years ago
- MFEA 2 (or MFEA-II). Multifactorial Evolutionary Optimization with Online Transfer Parameter Estimation in Python☆40Dec 24, 2019Updated 6 years ago
- GPT programs written in POWER-KI - Chat and PDF management☆19Jul 30, 2025Updated 8 months ago
- Chronos: Zero-Shot Identification of Libraries from Vulnerability Reports (ICSE 2023, Technical Track)☆11Jul 23, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMs☆16Aug 12, 2025Updated 7 months ago
- ☆18Jan 26, 2022Updated 4 years ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 5 months ago
- ☆14May 28, 2024Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Dec 15, 2023Updated 2 years ago
- ☆14Feb 2, 2023Updated 3 years ago
- Like Duolingo, but better☆39May 5, 2023Updated 2 years ago