⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file
☆51Jul 1, 2024Updated last year
Alternatives and similar repositories for CodeText-parser
Users that are interested in CodeText-parser are comparing it to the libraries listed below
Sorting:
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆15Jan 23, 2024Updated 2 years ago
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆43Jan 8, 2026Updated 2 months ago
- [ACL 2024] Novel reranking method to select the best solutions for code generation☆16Jun 9, 2024Updated last year
- Dataset and Code for NeurIPS 2023 paper "Language-driven Scene Synthesis using Multi-conditional Diffusion Model."☆48Aug 8, 2024Updated last year
- Custom ML tracking experiment and debugging tools.☆15Aug 2, 2022Updated 3 years ago
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆29Apr 21, 2025Updated 10 months ago
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation☆105Aug 21, 2024Updated last year
- ☆26Jul 19, 2022Updated 3 years ago
- 记录 git 的一些底层原理,包括对象、数据格式、传输协议、IO性能、底层子命令等。☆11Nov 29, 2022Updated 3 years ago
- Cross-Domain Deep Code Search with Few-Shot Learning☆11Jul 5, 2023Updated 2 years ago
- Generalist Software Agents to Solve Soware Engineering Tasks☆236Dec 10, 2024Updated last year
- [FORGE 2025] Predicting Program Behavior with Dynamic Dependencies Learning☆28Aug 15, 2024Updated last year
- ☆44Jun 24, 2025Updated 8 months ago
- [ICLR 2025 - Workshop AgenticAI Oral] Large Language Models powered Neural Solvers for Generalized Vehicle Routing Problems☆27May 29, 2025Updated 9 months ago
- Open-source Self-Instruction Tuning Code LLM☆172Apr 26, 2023Updated 2 years ago
- ☆12Nov 14, 2021Updated 4 years ago
- ☆29Oct 29, 2022Updated 3 years ago
- ☆10Feb 3, 2021Updated 5 years ago
- ☆15Jan 24, 2023Updated 3 years ago
- ☆13Apr 26, 2023Updated 2 years ago
- ☆15Jun 18, 2024Updated last year
- ☆28Oct 28, 2023Updated 2 years ago
- A collection of recent papers, benchmarks and datasets of AI4Code domain.☆59Apr 23, 2024Updated last year
- This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…☆27Aug 12, 2022Updated 3 years ago
- ☆16Nov 26, 2024Updated last year
- Adversarial Robustness for Code☆16Mar 30, 2021Updated 4 years ago
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆39Mar 30, 2025Updated 11 months ago
- Learning from what we know: How to perform vulnerability prediction using noisy historical data, Empirical Software Engineering (EMSE)☆14Sep 20, 2023Updated 2 years ago
- Stochastic Multiple Target Sampling Gradient Descent (NeurIPS 2022)☆13Sep 19, 2022Updated 3 years ago
- Learning graph-based code representations for source-level functional similarity detection. ICSE'23☆63Mar 27, 2023Updated 2 years ago
- Coverage-Guided Testing of Long Short-Term Memory (LSTM) Networks☆18Dec 15, 2020Updated 5 years ago
- Generating Adversarial Examples for Holding Robustness of Source Code Processing Models☆15Dec 2, 2021Updated 4 years ago
- [FORGE 2025] Incorporating Agile methodology into agents to create complex real-world softwares☆453Oct 15, 2024Updated last year
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".☆17May 25, 2025Updated 9 months ago
- Extract and combine multiple source code views using tree-sitter☆157Sep 17, 2025Updated 6 months ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- Code release for RobOT (ICSE'21)☆15Dec 5, 2022Updated 3 years ago
- Supplementary Material for Non-binary Deep Transfer Learning for Image Classification☆18Jul 22, 2021Updated 4 years ago
- Source Code Data Augmentation for Deep Learning: A Survey.☆66Jun 15, 2024Updated last year