A curated list of awesome transformer models.
☆681Apr 12, 2023Updated 3 years ago
Alternatives and similar repositories for awesome-transformers
Users that are interested in awesome-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Understanding large language models☆120Feb 24, 2023Updated 3 years ago
- Run, build, test transformer models using docker☆32May 2, 2023Updated 3 years ago
- ☆13Mar 22, 2023Updated 3 years ago
- NeMo: a toolkit for conversational AI☆12Dec 23, 2022Updated 3 years ago
- Chatbot for The Carbon Almanac book or a climate change related topic☆16Mar 6, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 8 months ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,741Jun 25, 2024Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,560Dec 26, 2023Updated 2 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,748Jan 8, 2024Updated 2 years ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Feb 9, 2023Updated 3 years ago
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,991Jun 12, 2024Updated last year
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Jun 23, 2020Updated 5 years ago
- ☆4,487Jul 25, 2024Updated last year
- A tiny library for coding with large language models.☆1,234Jul 10, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,161Mar 17, 2024Updated 2 years ago
- automatic sentence highlights based on their significance to the document☆196Nov 22, 2023Updated 2 years ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,368Oct 28, 2024Updated last year
- A timeline of the latest AI models for audio generation, starting in 2023!☆1,910Jan 4, 2024Updated 2 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,248Jul 17, 2024Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,921Mar 14, 2024Updated 2 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- LlamaIndex is the leading document agent and OCR platform☆49,501May 15, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,480May 1, 2025Updated last year
- A collection of libraries to optimise AI model performances☆8,345Jul 22, 2024Updated last year
- Instruct-tune LLaMA on consumer hardware☆18,925Jul 29, 2024Updated last year
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,652Sep 15, 2023Updated 2 years ago
- Explanation to key concepts in ML☆8,561Jun 30, 2025Updated 10 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,409Aug 17, 2024Updated last year
- 🔥Highlighting the top ML papers every week.☆12,431May 11, 2026Updated last week
- 📋 A list of open LLMs available for commercial use.☆12,763Feb 13, 2025Updated last year
- Run evaluation on LLMs using human-eval benchmark☆431Sep 12, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Like picoGPT but for BERT.☆51Mar 12, 2023Updated 3 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,079Jul 1, 2025Updated 10 months ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,388Updated this week
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,019Aug 21, 2024Updated last year
- Converse with book - Built with GPT-3☆596Oct 1, 2024Updated last year
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)☆441Apr 4, 2023Updated 3 years ago