LearningOpt / pieLinks
☆55Updated last year
Alternatives and similar repositories for pie
Users that are interested in pie are comparing it to the libraries listed below
Sorting:
- Training language models to make programs faster☆97Updated last year
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆161Updated last year
- SatLM: SATisfiability-Aided Language Models using Declarative Prompting (NeurIPS 2023)☆51Updated last year
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆118Updated last year
- Automatic DNN generation for fuzzing and more☆140Updated 11 months ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Updated 2 years ago
- ☆21Updated 3 years ago
- [ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environment☆136Updated 7 months ago
- Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings☆98Updated 6 months ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Updated last year
- [NeurIPS '25] Challenging Software Optimization Tasks for Evaluating SWE-Agents☆58Updated 2 weeks ago
- Utilities for constructing a large dataset of LLVM IR☆25Updated 6 months ago
- [FSE-2024] Towards AI-Assisted Synthesis of Verified Dafny Methods☆54Updated last year
- ☆111Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆164Updated 3 months ago
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆87Updated last year
- EvoEval: Evolving Coding Benchmarks via LLM☆80Updated last year
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators☆98Updated 6 months ago
- DafnyBench: A Benchmark for Formal Software Verification☆50Updated last year
- Benchmark ClassEval for class-level code generation.☆145Updated last year
- ☆120Updated last year
- Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.☆70Updated 2 years ago
- Making code edting up to 7.7x faster using multi-layer speculation☆24Updated 9 months ago
- Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In P…☆51Updated 8 months ago
- The first large scale formally verified reasoning dataset for Verilog☆17Updated 6 months ago
- A list of awesome neural symbolic papers.☆50Updated 3 years ago
- RepoQA: Evaluating Long-Context Code Understanding☆125Updated last year
- Pip compatible CodeBLEU metric implementation available for linux/macos/win☆126Updated 8 months ago
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆40Updated 9 months ago
- ☆13Updated 2 years ago