XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts
☆35Jul 2, 2024Updated last year
Alternatives and similar repositories for xft
Users that are interested in xft are comparing it to the libraries listed below
Sorting:
- Fast and Precise On-the-fly Patch Validation for All☆10Feb 24, 2023Updated 3 years ago
- The First International Workshop on Large Language Model for Code 2024 (Co-Located with ICSE 2024)☆17Oct 4, 2024Updated last year
- Artifact for ESEC/FSE'23 paper "NeuRI: Diversifying DNN Generation via Inductive Rule Inference"☆32Nov 13, 2023Updated 2 years ago
- WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models (OOPSLA 2024)☆78Aug 5, 2025Updated 6 months ago
- Collect simple coverage information in memory.☆11Oct 6, 2022Updated 3 years ago
- RepoQA: Evaluating Long-Context Code Understanding☆128Nov 1, 2024Updated last year
- Training and Benchmarking LLMs for Code Preference.☆38Nov 15, 2024Updated last year
- EvoEval: Evolving Coding Benchmarks via LLM☆81Apr 6, 2024Updated last year
- ☆25Dec 12, 2025Updated 2 months ago
- ☆16Jan 23, 2026Updated last month
- Artifact for TOSEM Submission: GiantRepair☆13Jun 26, 2024Updated last year
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago
- Benchmark ClassEval for class-level code generation.☆145Oct 24, 2024Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆323Feb 24, 2025Updated last year
- Our EMNLP 2022 paper on VIP-Based Prompting for Parameter-Efficient Learning☆10Oct 22, 2022Updated 3 years ago
- ☆10Oct 28, 2024Updated last year
- Fuzzing Automatic Differentiation in Deep-Learning Libraries (ICSE'23)☆27Mar 2, 2024Updated 2 years ago
- Free Lunch for Testing: Fuzzing Deep-Learning Libraries from Open Source (ICSE'22)☆82Nov 2, 2022Updated 3 years ago
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year
- ☆15Apr 2, 2025Updated 11 months ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- Automated DNN generation for fuzz testing and more☆143Jan 14, 2025Updated last year
- Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]☆56Jan 27, 2026Updated last month
- The official Implementation for TKDE paper "Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalizatio…☆14Aug 6, 2023Updated 2 years ago
- Repilot, a patch generation tool introduced in the ESEC/FSE'23 paper "Copiloting the Copilots: Fusing Large Language Models with Completi…☆136Oct 9, 2023Updated 2 years ago
- ☆44May 6, 2025Updated 9 months ago
- The official implementation of the DAC 2024 paper GQA-LUT☆20Dec 20, 2024Updated last year
- Fuzzing Deep-Learning Libraries via Automated Relational API Inference (ESEC/FSE 2022)☆40May 17, 2023Updated 2 years ago
- Personal blog + reading notes on system-ish papers☆15Oct 29, 2023Updated 2 years ago
- ☆19Nov 12, 2025Updated 3 months ago
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆25Oct 5, 2025Updated 4 months ago
- Adaptation of titans-pytorch to llama models on HF☆26Mar 6, 2025Updated 11 months ago
- ☆22Feb 29, 2024Updated 2 years ago
- This repo is for our submission for ICSE 2025.☆20Jun 12, 2024Updated last year
- Reproducing R1 for Code with Reliable Rewards☆290May 5, 2025Updated 9 months ago
- ☆84Nov 10, 2025Updated 3 months ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated 11 months ago
- ☆91Sep 10, 2023Updated 2 years ago