Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆63Apr 10, 2024Updated last year
Alternatives and similar repositories for astraios
Users that are interested in astraios are comparing it to the libraries listed below
Sorting:
- ☆33Feb 15, 2026Updated last week
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆25Aug 8, 2024Updated last year
- Training and Benchmarking LLMs for Code Preference.☆38Nov 15, 2024Updated last year
- Code for the curation of The Stack v2 and StarCoder2 training data☆126Apr 11, 2024Updated last year
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆478Feb 5, 2025Updated last year
- Making code edting up to 7.7x faster using multi-layer speculation☆24Feb 20, 2025Updated last year
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- [ISSTA'24] A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing☆12Jan 7, 2025Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Jan 2, 2025Updated last year
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- [ICSE'25] Specialized Fuzzing for LLVM Backend Code Generation☆21Mar 26, 2025Updated 11 months ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Jan 7, 2025Updated last year
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆14May 10, 2025Updated 9 months ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated last month
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆184Feb 17, 2025Updated last year
- Artifact for ESEC/FSE'23 paper "NeuRI: Diversifying DNN Generation via Inductive Rule Inference"☆32Nov 13, 2023Updated 2 years ago
- System for verifying the correctness of generated Copilot programs☆17May 8, 2025Updated 9 months ago
- Useful collection of Agora resources.☆19Nov 15, 2024Updated last year
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 6 months ago
- ☆18Jul 24, 2023Updated 2 years ago
- ☆24Dec 11, 2024Updated last year
- Simple notebooks to learn diffusion models on toy datasets☆17Feb 9, 2023Updated 3 years ago
- ☆19Mar 25, 2025Updated 11 months ago
- [ACL 2024] Progressive LLaMA with Block Expansion.☆514May 20, 2024Updated last year
- ☆16Mar 22, 2024Updated last year
- Code release of paper "ForkMerge: Mitigating Negative Transfer in Auxiliary-Task Learning" (NeurIPS 2023)☆17Dec 30, 2023Updated 2 years ago
- ☆489Aug 15, 2024Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆323Feb 24, 2025Updated last year
- ☆83May 10, 2024Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆588Dec 9, 2024Updated last year
- ☆19Jul 24, 2023Updated 2 years ago
- A Source Code Tokenizer☆14Oct 30, 2024Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago