A curated list of awesome transformer models.
☆683Apr 12, 2023Updated 3 years ago
Alternatives and similar repositories for awesome-transformers
Users that are interested in awesome-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Understanding large language models☆120Feb 24, 2023Updated 3 years ago
- Run, build, test transformer models using docker☆32May 2, 2023Updated 3 years ago
- NeMo: a toolkit for conversational AI☆12Dec 23, 2022Updated 3 years ago
- Chatbot for The Carbon Almanac book or a climate change related topic☆16Mar 6, 2023Updated 3 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,748Jun 25, 2024Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,560Dec 26, 2023Updated 2 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,750Jan 8, 2024Updated 2 years ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Feb 9, 2023Updated 3 years ago
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,989Jun 12, 2024Updated 2 years ago
- ☆4,490Jul 25, 2024Updated last year
- A tiny library for coding with large language models.☆1,233Jul 10, 2024Updated last year
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,157Mar 17, 2024Updated 2 years ago
- automatic sentence highlights based on their significance to the document☆196Nov 22, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,365Oct 28, 2024Updated last year
- A timeline of the latest AI models for audio generation, starting in 2023!☆1,910Jan 4, 2024Updated 2 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,248Jul 17, 2024Updated last year
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,921Mar 14, 2024Updated 2 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- LlamaIndex is the leading document agent and OCR platform☆50,073Updated this week
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,477May 1, 2025Updated last year
- A collection of libraries to optimise AI model performances☆8,337Jul 22, 2024Updated last year
- Instruct-tune LLaMA on consumer hardware☆18,916Jul 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,661Sep 15, 2023Updated 2 years ago
- Explanation to key concepts in ML☆8,568Jun 30, 2025Updated 11 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,399Aug 17, 2024Updated last year
- 🔥Highlighting the top ML papers every week.☆12,532Updated this week
- 📋 A list of open LLMs available for commercial use.☆12,797Feb 13, 2025Updated last year
- Run evaluation on LLMs using human-eval benchmark☆429Sep 12, 2023Updated 2 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,079Jul 1, 2025Updated 11 months ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,409Updated this week
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,019Aug 21, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Converse with book - Built with GPT-3☆596Oct 1, 2024Updated last year
- A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)☆441Apr 4, 2023Updated 3 years ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆822May 6, 2023Updated 3 years ago
- Foundation Architecture for (M)LLMs☆3,132Apr 11, 2024Updated 2 years ago
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆589Oct 3, 2023Updated 2 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,147Jan 23, 2026Updated 4 months ago