A curated list of awesome transformer models.
☆680Apr 12, 2023Updated 2 years ago
Alternatives and similar repositories for awesome-transformers
Users that are interested in awesome-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Understanding large language models☆120Feb 24, 2023Updated 3 years ago
- Run, build, test transformer models using docker☆32May 2, 2023Updated 2 years ago
- ☆13Mar 22, 2023Updated 3 years ago
- NeMo: a toolkit for conversational AI☆12Dec 23, 2022Updated 3 years ago
- Chatbot for The Carbon Almanac book or a climate change related topic☆16Mar 6, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 6 months ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,720Jun 25, 2024Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,559Dec 26, 2023Updated 2 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,743Jan 8, 2024Updated 2 years ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Feb 9, 2023Updated 3 years ago
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,985Jun 12, 2024Updated last year
- ☆4,487Jul 25, 2024Updated last year
- A tiny library for coding with large language models.☆1,233Jul 10, 2024Updated last year
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,163Mar 17, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- automatic sentence highlights based on their significance to the document☆197Nov 22, 2023Updated 2 years ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,376Oct 28, 2024Updated last year
- A timeline of the latest AI models for audio generation, starting in 2023!☆1,911Jan 4, 2024Updated 2 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,253Jul 17, 2024Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,928Mar 14, 2024Updated 2 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- LlamaIndex is the leading document agent and OCR platform☆48,389Updated this week
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,480May 1, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A collection of libraries to optimise AI model performances☆8,349Jul 22, 2024Updated last year
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,636Sep 15, 2023Updated 2 years ago
- Instruct-tune LLaMA on consumer hardware☆18,950Jul 29, 2024Updated last year
- Explanation to key concepts in ML☆8,541Jun 30, 2025Updated 9 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,425Aug 17, 2024Updated last year
- 🔥Highlighting the top ML papers every week.☆12,272Jul 20, 2025Updated 8 months ago
- 📋 A list of open LLMs available for commercial use.☆12,705Feb 13, 2025Updated last year
- Run evaluation on LLMs using human-eval benchmark☆430Sep 12, 2023Updated 2 years ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,329Updated this week
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,080Jul 1, 2025Updated 9 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,017Aug 21, 2024Updated last year
- Converse with book - Built with GPT-3☆596Oct 1, 2024Updated last year
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)☆443Apr 4, 2023Updated 3 years ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆822May 6, 2023Updated 2 years ago
- Foundation Architecture for (M)LLMs☆3,133Apr 11, 2024Updated last year