A curated list of awesome transformer models.
☆682Apr 12, 2023Updated 3 years ago
Alternatives and similar repositories for awesome-transformers
Users that are interested in awesome-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Understanding large language models☆120Feb 24, 2023Updated 3 years ago
- Run, build, test transformer models using docker☆32May 2, 2023Updated 3 years ago
- ☆13Mar 22, 2023Updated 3 years ago
- NeMo: a toolkit for conversational AI☆12Dec 23, 2022Updated 3 years ago
- Chatbot for The Carbon Almanac book or a climate change related topic☆16Mar 6, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 7 months ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,730Jun 25, 2024Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,558Dec 26, 2023Updated 2 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,745Jan 8, 2024Updated 2 years ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Feb 9, 2023Updated 3 years ago
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,988Jun 12, 2024Updated last year
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Jun 23, 2020Updated 5 years ago
- ☆4,485Jul 25, 2024Updated last year
- A tiny library for coding with large language models.☆1,233Jul 10, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,162Mar 17, 2024Updated 2 years ago
- automatic sentence highlights based on their significance to the document☆197Nov 22, 2023Updated 2 years ago
- A timeline of the latest AI models for audio generation, starting in 2023!☆1,911Jan 4, 2024Updated 2 years ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,366Oct 28, 2024Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,264Jul 17, 2024Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,928Mar 14, 2024Updated 2 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- LlamaIndex is the leading document agent and OCR platform☆48,997Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,482May 1, 2025Updated last year
- A collection of libraries to optimise AI model performances☆8,348Jul 22, 2024Updated last year
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,645Sep 15, 2023Updated 2 years ago
- Instruct-tune LLaMA on consumer hardware☆18,945Jul 29, 2024Updated last year
- Explanation to key concepts in ML☆8,553Jun 30, 2025Updated 10 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,417Aug 17, 2024Updated last year
- 🔥Highlighting the top ML papers every week.☆12,347Apr 21, 2026Updated last week
- 📋 A list of open LLMs available for commercial use.☆12,739Feb 13, 2025Updated last year
- Run evaluation on LLMs using human-eval benchmark☆430Sep 12, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Like picoGPT but for BERT.☆51Mar 12, 2023Updated 3 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,084Jul 1, 2025Updated 10 months ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,362Updated this week
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,018Aug 21, 2024Updated last year
- Converse with book - Built with GPT-3☆596Oct 1, 2024Updated last year
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)☆441Apr 4, 2023Updated 3 years ago