Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face π€ Transformers.
β580Mar 11, 2026Updated 2 weeks ago
Alternatives and similar repositories for mistral
Users that are interested in mistral are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhiβ¦β49Apr 26, 2021Updated 4 years ago
- β22Aug 31, 2021Updated 4 years ago
- β11Jan 2, 2022Updated 4 years ago
- Exploring Few-Shot Adaptation of Language Models with Tablesβ24Aug 22, 2022Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixingβ49Jan 27, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- jiant is an nlp toolkitβ1,674Jul 6, 2023Updated 2 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deploymentβ791Apr 24, 2023Updated 2 years ago
- Fast, general, and tested differentiable structured prediction in PyTorchβ1,124Apr 20, 2022Updated 3 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)β102Nov 2, 2020Updated 5 years ago
- Library for Knowledge Intensive Language Tasksβ971Mar 31, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed librariesβ7,404Feb 3, 2026Updated last month
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Jul 28, 2022Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weightsβ19Oct 9, 2022Updated 3 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.β48Nov 30, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- β221Jun 8, 2020Updated 5 years ago
- β2,956Mar 9, 2026Updated 2 weeks ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"β456Sep 6, 2023Updated 2 years ago
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language modelsβ3,220Jul 19, 2024Updated last year
- OSLO: Open Source framework for Large-scale model Optimizationβ309Aug 25, 2022Updated 3 years ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckListβ2,049Jan 9, 2024Updated 2 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)β464Nov 5, 2022Updated 3 years ago
- NL-Augmenter π¦ β π A Collaborative Repository of Natural Language Transformationsβ786May 19, 2024Updated last year
- β92Sep 29, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Toolkit for creating, sharing and using natural language prompts.β3,007Oct 23, 2023Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.oβ¦β606Jun 15, 2022Updated 3 years ago
- Silly twitter torch implementations.β46Oct 14, 2022Updated 3 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"β1,626Jun 12, 2023Updated 2 years ago
- β99Jul 25, 2023Updated 2 years ago
- FastFormers - highly efficient transformer models for NLUβ709Mar 21, 2025Updated last year
- β75Jul 2, 2021Updated 4 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".β80Jun 3, 2021Updated 4 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pβ¦β433Aug 17, 2022Updated 3 years ago
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answeringβ175Jun 6, 2021Updated 4 years ago
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,580Mar 23, 2026Updated last week
- Meta Representation Transformation for Low-resource Cross-lingual Learningβ41May 5, 2021Updated 4 years ago
- Expanding natural instructionsβ1,038Dec 11, 2023Updated 2 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for π€ Hugging Face transformer models πβ1,688Oct 23, 2024Updated last year
- LAnguage Model Analysisβ1,389Jul 7, 2024Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.β594Updated this week