[EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
β105Aug 21, 2024Updated last year
Alternatives and similar repositories for TheVault
Users that are interested in TheVault are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Commentβ15Jan 23, 2024Updated 2 years ago
- [ICLR 2025] π CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.β30Apr 21, 2025Updated last year
- Open-source Self-Instruction Tuning Code LLMβ171Apr 26, 2023Updated 3 years ago
- βοΈ Tree-sitter custom toolkit for extracting function and class from raw source fileβ52Jul 1, 2024Updated last year
- Language Model for Mainframe Modernizationβ74Aug 23, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A list of papers and resources dedicated to code generationβ21Nov 2, 2022Updated 3 years ago
- Generalist Software Agents to Solve Soware Engineering Tasksβ246Dec 10, 2024Updated last year
- β12Apr 4, 2024Updated 2 years ago
- [FORGE 2025] Predicting Program Behavior with Dynamic Dependencies Learningβ31Aug 15, 2024Updated last year
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".β17May 25, 2025Updated last year
- Investigation into whether Transformers and self-supervised learning could be used to trade currency marketsβ10Jun 21, 2023Updated 3 years ago
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyondβ23Apr 9, 2023Updated 3 years ago
- Official implementation of "From Implicit to Explicit Feedback: A deep neural network for modeling sequential behaviours and long-short tβ¦β19Oct 16, 2025Updated 8 months ago
- SQL autocomplete dataβ29Jun 18, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data and code for "Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models", which accepted in TSE.β15Jul 3, 2024Updated last year
- Sythetic data generation and normalization functions powered by LLMsβ59Sep 19, 2024Updated last year
- pi + rainbowhat + touchscreen + usb sound card (mic or aux in) + open ai = audio logic anaklyzerβ12Apr 23, 2023Updated 3 years ago
- [FORGE 2025] Incorporating Agile methodology into agents to create complex real-world softwaresβ462Oct 15, 2024Updated last year
- β26Jul 19, 2022Updated 3 years ago
- chatsnack is the easiest Python library for rapid development with OpenAI's ChatGPT API. It's an intuitive interface for creating and manβ¦β29Updated this week
- β16Jun 5, 2023Updated 3 years ago
- β26Nov 12, 2025Updated 7 months ago
- A collection of recent papers, benchmarks and datasets of AI4Code domain.β60Apr 23, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Yet another LLMβ10Apr 6, 2023Updated 3 years ago
- Demonstrates how to formulate the n-queens problem as a QUBO, which we then solve using Leapβs hybrid solvers.β10Mar 3, 2026Updated 3 months ago
- replacement of AdamW and Lion optimizer for LLMsβ13May 28, 2023Updated 3 years ago
- β33Apr 23, 2023Updated 3 years ago
- Reweight GPT - a simple neural network using transformer architecture for next character predictionβ57Aug 28, 2023Updated 2 years ago
- Buzz AI, aka gt-chat, is a fast and intuitive question-answering chatbot for Georgia Tech. Powered by Next.js, FastAPI, and OpenAI, it soβ¦β30Apr 13, 2023Updated 3 years ago
- β118Jul 17, 2024Updated last year
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023β57Oct 27, 2023Updated 2 years ago
- MFEA 2 (or MFEA-II). Multifactorial Evolutionary Optimization with Online Transfer Parameter Estimation in Pythonβ40Dec 24, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GPT programs written in POWER-KI - Chat and PDF managementβ19Jul 30, 2025Updated 10 months ago
- Reproduce the results of Tree-based Convolutional Neural Network (TBCNN)β39Mar 25, 2023Updated 3 years ago
- Chronos: Zero-Shot Identification of Libraries from Vulnerability Reports (ICSE 2023, Technical Track)β11Jul 23, 2023Updated 2 years ago
- Variable Selection Network with PyTorchβ12May 29, 2024Updated 2 years ago
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMsβ17Aug 12, 2025Updated 10 months ago
- A collection of datasets for machine learning for big codeβ65Oct 8, 2021Updated 4 years ago
- zero-code hyperparameters optimization frameworkβ14Jan 25, 2024Updated 2 years ago