bigcode-project / transformersLinks

☆26

Alternatives and similar repositories for transformers

Users that are interested in transformers are comparing it to the libraries listed below

Sorting:

bigcode-project / bigcode-analysis
Repository for analysis and experiments in the BigCode project.
☆118Updated last year
LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆54Updated last year
NL2Code / CodeM
☆44Updated last year
jina-ai / jerboa
LLM finetuning
☆42Updated last year
fblgit / tree-of-knowledge
ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…
☆54Updated last year
CarperAI / Code-Pile
This repository contains all the code for collecting large scale amounts of code from GitHub.
☆108Updated 2 years ago
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆65Updated last year
togethercomputer / OpenDataHub
☆128Updated 2 years ago
EleutherAI / lm_perplexity
☆149Updated 4 years ago
argilla-io / awesome-llm-datasets
👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
☆23Updated 2 years ago
explodinggradients / Funtuner
Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆35Updated last year
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆103Updated 8 months ago
TheDuckAI / arb
Advanced Reasoning Benchmark Dataset for LLMs
☆46Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
SuperpoweredAI / task-tree-agent
LLM-powered autonomous agent with hierarchical task management
☆49Updated 2 years ago
VITA-Group / ChainCoder
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆40Updated last year
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆44Updated last year
young-geng / koala_data_pipeline
The data processing pipeline for the Koala chatbot language model
☆117Updated 2 years ago
petals-infra / health.petals.dev
🏥 Health monitor for a Petals swarm
☆38Updated 10 months ago
huggingface / hf-endpoints-documentation
☆17Updated 2 weeks ago
qrdlgit / graph-of-thoughts
Based on the tree of thoughts paper
☆48Updated last year
allenai / recoma
Reasoning by Communicating with Agents
☆28Updated last month
mistralai / vllm-release
A high-throughput and memory-efficient inference and serving engine for LLMs
☆52Updated last year
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago
deepinfra / deepctl
Command line tool for Deep Infra cloud ML inference service
☆31Updated 11 months ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
google-research-datasets / Attributed-QA
We believe the ability of an LLM to attribute the text that it generates is likely to be crucial for both system developers and users in …
☆54Updated last year
mzbac / AutoGPTQ-API
Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.
☆91Updated last year
nyu-mll / ILF-for-code-generation
☆75Updated 2 months ago
cg123 / rathe
Tools for formatting large language model prompts.
☆13Updated last year