microsoft / unilmLinks
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
β21,875Updated 5 months ago
Alternatives and similar repositories for unilm
Users that are interested in unilm are comparing it to the libraries listed below
Sorting:
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,215Updated last week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,348Updated last week
- Fast and memory-efficient exact attentionβ20,904Updated last week
- Train transformer language models with reinforcement learning.β16,552Updated this week
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,061Updated last year
- Latest Advances on Multimodal Large Language Modelsβ16,945Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β40,961Updated this week
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β153,571Updated this week
- Foundation Architecture for (M)LLMsβ3,125Updated last year
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)β¦β14,203Updated 3 weeks ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β13,032Updated 11 months ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,785Updated last year
- Ongoing research training transformer models at scaleβ14,493Updated this week
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parametersβ5,924Updated last year
- State-of-the-Art Text Embeddingsβ17,985Updated this week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ31,860Updated last year
- An open source implementation of CLIP.β13,089Updated last month
- Instruct-tune LLaMA on consumer hardwareβ18,983Updated last year
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,561Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,108Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,242Updated last year
- Accessible large language models via k-bit quantization for PyTorch.β7,801Updated last week
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)β3,985Updated last year
- Repo for external large-scale workβ6,548Updated last year
- General technology for enabling AI capabilities w/ LLMs and MLLMsβ4,217Updated last week
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)β25,762Updated last year
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,157Updated last week
- Unsupervised text tokenizer for Neural Network-based text generation.β11,490Updated last week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,020Updated 2 months ago
- Inference code for Llama modelsβ58,983Updated 10 months ago