neuml/txtinstruct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neuml/txtinstruct)

neuml / txtinstruct

📚 Datasets and models for instruction-tuning

☆238

Alternatives and similar repositories for txtinstruct

Users that are interested in txtinstruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

neuml / txtchat
View on GitHub
⚡ Local chat assistants with AI superpowers
☆336Feb 13, 2026Updated 5 months ago
neuml / txtai.py
View on GitHub
Python client for txtai
☆15Jul 1, 2026Updated 2 weeks ago
deep-diver / complete-mlops-system-workflow
View on GitHub
☆17Sep 9, 2022Updated 3 years ago
neuml / txtai
View on GitHub
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
☆12,733Updated this week
prohandler / GS-Bulk-Emails
View on GitHub
Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email
☆17Dec 11, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ConiferLabsWA / flan-ul2-alpaca
View on GitHub
☆33Apr 23, 2023Updated 3 years ago
neuml / txtmarker
View on GitHub
🖍️ Highlight text in documents
☆113Feb 13, 2026Updated 5 months ago
neuml / magnitude
View on GitHub
Magnitude fork that only supports Word2Vec, GloVe and fastText embeddings
☆13Aug 11, 2020Updated 5 years ago
chainyo / picaisso
View on GitHub
🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.
☆50May 8, 2023Updated 3 years ago
bentoml / sentence-embedding-bento
View on GitHub
Sentence Embedding as a Service
☆15Jun 30, 2025Updated last year
openlm-research / open_llama
View on GitHub
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,530Jul 16, 2023Updated 3 years ago
yaodongC / awesome-instruction-dataset
View on GitHub
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
☆1,152Jan 4, 2024Updated 2 years ago
mscarey / legislice
View on GitHub
API client for fetching and comparing passages from legislation
☆14Jun 29, 2026Updated 3 weeks ago
AgeOfMarcus / 1337GPT
View on GitHub
My attempt at making a GPT agent for pentesting
☆44May 10, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
JSv4 / AtticusClassifier
View on GitHub
Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus
☆14Jan 2, 2021Updated 5 years ago
BatsResearch / bonito
View on GitHub
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
☆831Jul 15, 2025Updated last year
neuml / py27hash
View on GitHub
Python 2.7 hashing and iteration in Python 3+
☆19Nov 20, 2022Updated 3 years ago
stochasticai / xTuring
View on GitHub
Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-s…
☆2,671Mar 4, 2026Updated 4 months ago
salesforce / xgen
View on GitHub
Salesforce open-source LLMs with 8k sequence length.
☆727Jun 2, 2026Updated last month
lhenault / simpleAI
View on GitHub
An easy way to host your own AI API and expose alternative models, while being compatible with "open" AI clients.
☆328Jul 16, 2024Updated 2 years ago
neuml / tldrstory
View on GitHub
📊 Semantic search for headlines and story text
☆359Sep 23, 2023Updated 2 years ago
TobiasM95 / CHARLIE
View on GitHub
Github repo of the CHARLIE AI interaction project
☆14Aug 2, 2023Updated 2 years ago
srush / MiniChain
View on GitHub
A tiny library for coding with large language models.
☆1,233Jul 10, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hxu296 / gt-chat
View on GitHub
Buzz AI, aka gt-chat, is a fast and intuitive question-answering chatbot for Georgia Tech. Powered by Next.js, FastAPI, and OpenAI, it so…
☆30Apr 13, 2023Updated 3 years ago
davidberenstein1957 / classy-classification
View on GitHub
This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…
☆221Jan 20, 2025Updated last year
TheAtticusProject / maud
View on GitHub
☆98Feb 15, 2023Updated 3 years ago
philschmid / easyllm
View on GitHub
☆476Dec 27, 2023Updated 2 years ago
deshwalmahesh / PHUDGE
View on GitHub
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆53Jul 10, 2024Updated 2 years ago
hegelai / prompttools
View on GitHub
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…
☆3,042Feb 11, 2026Updated 5 months ago
neuml / codequestion
View on GitHub
🔎 Semantic search for developers
☆541Sep 23, 2023Updated 2 years ago
MantisAI / hugie
View on GitHub
Command Line Interface for Hugging Face Inference Endpoints
☆65Apr 10, 2024Updated 2 years ago
declare-lab / flan-alpaca
View on GitHub
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…
☆356Jul 4, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nileshkhetrapal / YassQueenDB
View on GitHub
Graph database library that allows you to store, analyze, and search through your data in a graph format. By using the Universal Sentence…
☆16May 26, 2023Updated 3 years ago
momegas / megabots
View on GitHub
🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵
☆349Jun 10, 2023Updated 3 years ago
jina-ai / product-recommendation-redis-docarray
View on GitHub
☆26Dec 28, 2022Updated 3 years ago
raunak-agarwal / instruction-datasets
View on GitHub
Datasets for Instruction Tuning of Large Language Models
☆261Nov 30, 2023Updated 2 years ago
microsoft / xtreme-distil-transformers
View on GitHub
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆157Dec 20, 2023Updated 2 years ago
huggingface / setfit
View on GitHub
Efficient few-shot learning with Sentence Transformers
☆2,772May 26, 2026Updated last month
taylorai / galactic
View on GitHub
data cleaning and curation for unstructured text
☆329Aug 6, 2024Updated last year