bigcode-project / transformersLinks
☆26Updated last year
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- Pre-training code for CrystalCoder 7B LLM☆56Updated last year
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆110Updated 2 years ago
- The data processing pipeline for the Koala chatbot language model☆118Updated 2 years ago
- ☆128Updated 2 years ago
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…☆55Updated 2 years ago
- LLM finetuning☆42Updated 2 years ago
- ☆74Updated 2 years ago
- Repository for analysis and experiments in the BigCode project.☆128Updated last year
- ☆44Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆149Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Updated 2 years ago
- LLM-powered autonomous agent with hierarchical task management☆50Updated 2 years ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Updated 2 years ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆301Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- ☆85Updated 2 years ago
- Open Implementations of LLM Analyses☆107Updated last year
- Smol but mighty language model☆63Updated 2 years ago
- ☆129Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆86Updated 2 years ago
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆45Updated 2 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- Fine-tune SantaCoder for Code/Text Generation.☆194Updated 2 years ago
- Script for downloading GitHub.☆98Updated last year
- ☆142Updated 2 years ago
- Multi-Domain Expert Learning☆67Updated last year