bigcode-project / selfcodealign
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
β268Updated 2 weeks ago
Related projects β
Alternatives and complementary repositories for selfcodealign
- π OctoPack: Instruction Tuning Code Large Language Modelsβ435Updated last month
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"β293Updated 11 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"β212Updated last month
- Run evaluation on LLMs using human-eval benchmarkβ379Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"β448Updated 8 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)β199Updated 6 months ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward expβ¦β216Updated 7 months ago
- β86Updated last year
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answeβ¦β142Updated 9 months ago
- The official evaluation suite and dynamic data release for MixEval.β224Updated last week
- RepoQA: Evaluating Long-Context Code Understandingβ100Updated 2 weeks ago
- Multipack distributed sampler for fast padding-free training of LLMsβ178Updated 3 months ago
- Benchmarking LLMs with Challenging Tasks from Real Usersβ195Updated 2 weeks ago
- β295Updated 5 months ago
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".β262Updated last year
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.β307Updated 7 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluationβ114Updated last month
- Implementation of paper Data Engineering for Scaling Language Models to 128K Contextβ438Updated 8 months ago
- Expert Specialized Fine-Tuningβ145Updated last month
- β¨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024β133Updated 3 months ago
- β184Updated last month
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"β358Updated last month
- β146Updated 3 months ago
- A pipeline to improve skills of large language modelsβ191Updated this week
- The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]β129Updated this week
- Open Source WizardCoder Datasetβ153Updated last year
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.β408Updated 2 months ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β122Updated 3 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"β113Updated 5 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clustersβ104Updated last month