VHellendoorn / Code-LMs
Guide to using pre-trained large language models of source code
☆1,807Updated 7 months ago
Alternatives and similar repositories for Code-LMs:
Users that are interested in Code-LMs are comparing it to the libraries listed below
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆2,891Updated last year
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,002Updated 2 weeks ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,466Updated 3 weeks ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,456Updated 6 months ago
- Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57☆3,300Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,101Updated this week
- Home of StarCoder: fine-tuning & inference!☆7,356Updated 11 months ago
- CodeXGLUE☆1,605Updated 9 months ago
- Code Generation using GPT-J!☆515Updated 2 years ago
- ☆1,452Updated last year
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,378Updated 2 months ago
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆726Updated 11 months ago
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,649Updated 2 months ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,842Updated last year
- 4 bits quantization of LLaMA using GPTQ☆3,036Updated 7 months ago
- ☆626Updated 3 months ago
- ☆9,019Updated 10 months ago
- Alpaca dataset from Stanford, cleaned and curated☆1,537Updated last year
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,864Updated 8 months ago
- Generative model for code infilling and synthesis☆299Updated last year
- Quantized inference code for LLaMA models☆1,052Updated last year
- Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU☆3,820Updated last year
- An open-source implementation of Google's PaLM models☆818Updated 7 months ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆280Updated last year
- Agent techniques to augment your LLM and push it beyong its limits☆1,567Updated 8 months ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,623Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆515Updated 3 weeks ago
- A program that provides LLMs with the ability to complete complex tasks using plugins.☆1,756Updated 10 months ago
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,164Updated 11 months ago
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,137Updated last year