bigcode-project / transformersLinks
☆26Updated last year
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- Repository for analysis and experiments in the BigCode project.☆118Updated last year
- Pre-training code for CrystalCoder 7B LLM☆54Updated last year
- ☆44Updated last year
- LLM finetuning☆42Updated last year
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…☆54Updated last year
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆108Updated 2 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated last year
- ☆128Updated 2 years ago
- ☆149Updated 4 years ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated 2 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated last year
- Open Implementations of LLM Analyses☆103Updated 8 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆46Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- LLM-powered autonomous agent with hierarchical task management☆49Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated 2 years ago
- 🏥 Health monitor for a Petals swarm☆38Updated 10 months ago
- ☆17Updated 2 weeks ago
- Based on the tree of thoughts paper☆48Updated last year
- Reasoning by Communicating with Agents☆28Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆52Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Command line tool for Deep Infra cloud ML inference service☆31Updated 11 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- We believe the ability of an LLM to attribute the text that it generates is likely to be crucial for both system developers and users in …☆54Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Updated last year
- ☆75Updated 2 months ago
- Tools for formatting large language model prompts.☆13Updated last year