argilla-io/notus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/argilla-io/notus)

argilla-io / notus

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

☆168

Alternatives and similar repositories for notus

Users that are interested in notus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alvarobartt / vertex-ai-huggingface-inference-toolkit
View on GitHub
🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)
☆17Mar 20, 2024Updated 2 years ago
gabrielmbmb / candle-holder
View on GitHub
A Rust crate offering similar functionality to the Python transformers package using Candle.
☆15Nov 19, 2024Updated last year
huggingface / data-is-better-together
View on GitHub
Let's build better datasets, together!
☆274Jun 9, 2026Updated last month
mlabonne / llm-autoeval
View on GitHub
Automatically evaluate your LLMs in Google Colab
☆695May 7, 2024Updated 2 years ago
argilla-io / distilabel
View on GitHub
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆3,344Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
nbroad1881 / strideformer
View on GitHub
Using short models to classify long texts
☆21Mar 8, 2023Updated 3 years ago
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,643May 26, 2026Updated last month
argilla-io / distilabel-spin-dibt
View on GitHub
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Mar 12, 2024Updated 2 years ago
Helw150 / levanter
View on GitHub
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆16Jun 16, 2024Updated 2 years ago
aigeek0x0 / radiantloom-email-assist-7b
View on GitHub
Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…
☆14Jan 19, 2024Updated 2 years ago
austinsilveria / tricksy
View on GitHub
Fast approximate inference on a single GPU with sparsity aware offloading
☆38Jan 4, 2024Updated 2 years ago
argilla-io / argilla
View on GitHub
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆5,048Updated this week
OpenBMB / UltraFeedback
View on GitHub
A large-scale, fine-grained, diverse preference dataset (and models).
☆368Dec 29, 2023Updated 2 years ago
camenduru / champ-jupyter
View on GitHub
☆12Mar 25, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
xaiguy / chippy
View on GitHub
☆13Feb 26, 2023Updated 3 years ago
alvarobartt / opentrain
View on GitHub
🚂 Fine-tune OpenAI models for text classification, question answering, and more
☆17May 1, 2023Updated 3 years ago
hamishivi / EasyLM
View on GitHub
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆78Aug 17, 2024Updated last year
alvarobartt / understanding-resnet
View on GitHub
🧠 ResNet: Deep Residual Learning for Image Recognition
☆10Sep 18, 2021Updated 4 years ago
kevinwu23 / StanfordFineTuneBench
View on GitHub
☆32Nov 14, 2024Updated last year
togethercomputer / Llama-2-7B-32K-Instruct
View on GitHub
☆84Aug 18, 2023Updated 2 years ago
myshell-ai / JetMoE
View on GitHub
Reaching LLaMA2 Performance with 0.1M Dollars
☆985Jul 23, 2024Updated 2 years ago
neavo / KeywordGachaModel
View on GitHub
☆17Jan 31, 2025Updated last year
alvarobartt / ml-monitoring-with-wandb
View on GitHub
Monitoring a PyTorch Lightning CNN with Weights & Biases
☆15Jul 26, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
satpalsr / personalgpt
View on GitHub
Opensource, personal & local chat interface for language models.
☆13Jun 24, 2024Updated 2 years ago
databricks / megablocks
View on GitHub
☆1,582Mar 25, 2026Updated 4 months ago
camenduru / video-dubbing-colab
View on GitHub
☆16Sep 30, 2023Updated 2 years ago
bespokelabsai / verifiers
View on GitHub
Verifiers for LLM Reinforcement Learning
☆81Jul 17, 2026Updated last week
lm-sys / llm-decontaminator
View on GitHub
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆325Dec 20, 2023Updated 2 years ago
huggingface / datatrove
View on GitHub
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
☆3,220Updated this week
SkyworkAI / Skywork-MoE
View on GitHub
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
☆140Jun 12, 2024Updated 2 years ago
leonjovanovic / keywords-extraction
View on GitHub
Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.
☆12May 22, 2023Updated 3 years ago
predibase / lorax
View on GitHub
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,820May 28, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hrishioa / tough-llm-tests
View on GitHub
Some tough questions to test new models.
☆28Apr 20, 2024Updated 2 years ago
philschmid / terraform-aws-llm-sagemaker
View on GitHub
☆20Aug 5, 2024Updated last year
alvarobartt / simpsons-mnist
View on GitHub
A small MNIST-like The Simpsons character database to at least have some fun while training neural networks.
☆12May 12, 2021Updated 5 years ago
Yongtae723 / chat-your-data
View on GitHub
☆11May 21, 2023Updated 3 years ago
huggingface / nanotron
View on GitHub
Minimalistic large language model 3D-parallelism training
☆2,764May 26, 2026Updated last month
alvarobartt / covid-daily
View on GitHub
🦠 COVID-19 Daily Data from Worldometers with Python
☆13Feb 28, 2021Updated 5 years ago
AI-ANK / c3-python-nostream
View on GitHub
Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…
☆24Jan 7, 2024Updated 2 years ago