Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆63Apr 10, 2024Updated 2 years ago
Alternatives and similar repositories for astraios
Users that are interested in astraios are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Mar 21, 2026Updated 3 months ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆26Aug 8, 2024Updated last year
- Making code edting up to 7.7x faster using multi-layer speculation☆24Feb 20, 2025Updated last year
- Training and Benchmarking LLMs for Code Preference.☆38Nov 15, 2024Updated last year
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆478Feb 5, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official implementation of our ICSE 2023 paper on Automatic Code Generation.☆27Nov 8, 2023Updated 2 years ago
- Code for the curation of The Stack v2 and StarCoder2 training data☆136Apr 11, 2024Updated 2 years ago
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆193Feb 17, 2025Updated last year
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆36Jul 2, 2024Updated last year
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Small, simple agent task environments for training and evaluation☆20Nov 1, 2024Updated last year
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- Directed masked autoencoders☆15Mar 25, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 3 years ago
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆15Jan 23, 2024Updated 2 years ago
- System for verifying the correctness of generated Copilot programs☆19May 8, 2025Updated last year
- WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models (OOPSLA 2024)☆83Aug 5, 2025Updated 10 months ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 5 months ago
- [ISSTA'24] A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing☆12Jan 7, 2025Updated last year
- [ICSE'25] Specialized Fuzzing for LLVM Backend Code Generation☆22Mar 26, 2025Updated last year
- [ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain☆11Nov 24, 2025Updated 7 months ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆19Nov 29, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆86May 10, 2024Updated 2 years ago
- Data sets and ML models versioning example from DVC get started☆10Jun 4, 2024Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆322Feb 24, 2025Updated last year
- Accepted by Transactions on Machine Learning Research (TMLR)☆135Oct 5, 2024Updated last year
- [ACL 2024] Progressive LLaMA with Block Expansion.☆513May 20, 2024Updated 2 years ago
- Replication package for the SANER 2023 paper titled "Extending Source Code Pre-Trained Language Models to Summarise Decompiled Binaries"☆17Jul 8, 2024Updated last year
- Self-Alignment with Principle-Following Reward Models☆170Sep 18, 2025Updated 9 months ago
- ☆495Aug 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch使用技巧和教程☆12Apr 17, 2023Updated 3 years ago
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆116Jan 9, 2024Updated 2 years ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- Probing pre-trained source code models☆15Apr 27, 2022Updated 4 years ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Jan 2, 2025Updated last year
- EvoEval: Evolving Coding Benchmarks via LLM☆84Apr 6, 2024Updated 2 years ago
- ☆20Apr 26, 2026Updated 2 months ago