Naman-ntc / FastCode
Utilities for efficient fine-tuning, inference and evaluation of code generation models
☆21Updated last year
Alternatives and similar repositories for FastCode:
Users that are interested in FastCode are comparing it to the libraries listed below
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 3 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated 3 months ago
- Using FlexAttention to compute attention with different masking patterns☆42Updated 6 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- ☆75Updated last week
- ☆13Updated 7 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆17Updated last month
- CodeUltraFeedback: aligning large language models to coding preferences☆71Updated 9 months ago
- ☆32Updated 9 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆22Updated 7 months ago
- ☆23Updated 6 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆43Updated 8 months ago
- ☆27Updated last week
- ☆48Updated last year
- Commit0: Library Generation from Scratch☆140Updated this week
- ☆21Updated 5 months ago
- ☆18Updated 10 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 2 months ago
- Exploration of automated dataset selection approaches at large scales.☆34Updated 3 weeks ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- ☆60Updated 11 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 11 months ago
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆24Updated last year
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆32Updated 5 months ago
- Repository for Skill Set Optimization☆12Updated 8 months ago
- ☆25Updated 11 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 6 months ago