bigcode-project / transformers
☆26Updated 10 months ago
Alternatives and similar repositories for transformers:
Users that are interested in transformers are comparing it to the libraries listed below
- Repository for analysis and experiments in the BigCode project.☆117Updated 9 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated 8 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- LLM finetuning☆43Updated last year
- ☆75Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆45Updated last year
- Open Implementations of LLM Analyses☆96Updated 3 months ago
- ☆44Updated 7 months ago
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆105Updated last year
- Data preparation code for CrystalCoder 7B LLM☆43Updated 8 months ago
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…☆50Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆116Updated last year
- ☆20Updated last year
- Learning to Program with Natural Language☆4Updated last year
- Just a bunch of benchmark logs for different LLMs☆116Updated 5 months ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated 3 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated last month
- Evaluating LLMs with CommonGen-Lite☆87Updated 9 months ago
- Github repo for storing LlamaDatasets☆32Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆80Updated last year
- Downloads 2020 English Wikipedia articles as plaintext☆22Updated last year
- ☆74Updated last year
- The Next Generation Multi-Modality Superintelligence☆70Updated 4 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 7 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆22Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated last year