Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
β169Jan 15, 2024Updated 2 years ago
Alternatives and similar repositories for notus
Users that are interested in notus are comparing it to the libraries listed below
Sorting:
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Mar 20, 2024Updated last year
- A Rust crate offering similar functionality to the Python transformers package using Candle.β14Nov 19, 2024Updated last year
- Let's build better datasets, together!β271Dec 20, 2024Updated last year
- Data extraction with LLM on CPUβ271Mar 26, 2024Updated last year
- Using short models to classify long textsβ21Mar 8, 2023Updated 3 years ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β3,114Mar 2, 2026Updated last week
- Automatically evaluate your LLMs in Google Colabβ687May 7, 2024Updated last year
- Robust recipes to align language models with human and AI preferencesβ5,510Sep 8, 2025Updated 6 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flβ¦β78Aug 17, 2024Updated last year
- Verifiers for LLM Reinforcement Learningβ80Apr 15, 2025Updated 10 months ago
- Reaching LLaMA2 Performance with 0.1M Dollarsβ989Jul 23, 2024Updated last year
- Official repository for ORPOβ472May 31, 2024Updated last year
- β19Jan 11, 2024Updated 2 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,884Mar 2, 2026Updated last week
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β24Jan 7, 2024Updated 2 years ago
- β11Oct 11, 2023Updated 2 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1β¦β14Jan 19, 2024Updated 2 years ago
- Long Context Researchβ29Jan 26, 2026Updated last month
- β12Mar 25, 2024Updated last year
- π§ ResNet: Deep Residual Learning for Image Recognitionβ10Sep 18, 2021Updated 4 years ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Modelsβ139Jun 12, 2024Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).β364Dec 29, 2023Updated 2 years ago
- Opensource, personal & local chat interface for language models.β13Jun 24, 2024Updated last year
- β15Jun 2, 2025Updated 9 months ago
- A local search system implementation using Elasticsearch for Wikipedia data indexing and retrieval.β12May 17, 2025Updated 9 months ago
- A simple vector DB built on top of SQLite and Numpyβ14Aug 26, 2023Updated 2 years ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ3,732May 21, 2025Updated 9 months ago
- β31Nov 14, 2024Updated last year
- Official repository for LongChat and LongEvalβ533May 24, 2024Updated last year
- β50Sep 8, 2025Updated 6 months ago
- Codebase and project page for EDMSoundβ35Nov 20, 2023Updated 2 years ago
- Learn & build: Always available expertise powered by AIβ12Jul 10, 2023Updated 2 years ago
- π¦ COVID-19 Daily Data from Worldometers with Pythonβ13Feb 28, 2021Updated 5 years ago
- Chunk Dedupe Estimationβ20Nov 5, 2024Updated last year
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Rewardβ946Feb 16, 2025Updated last year
- β85Aug 18, 2023Updated 2 years ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabilβ¦β30Dec 19, 2023Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,915Mar 3, 2026Updated last week
- Simple examples using Argilla tools to build AIβ57Nov 18, 2024Updated last year