π Datasets and models for instruction-tuning
β238Sep 23, 2023Updated 2 years ago
Alternatives and similar repositories for txtinstruct
Users that are interested in txtinstruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β‘ Local chat assistants with AI superpowersβ336Feb 13, 2026Updated 3 months ago
- Python client for txtaiβ15May 12, 2026Updated last week
- Tokenizer for Text to Speech (TTS) modelsβ13Jan 16, 2025Updated last year
- β17Sep 9, 2022Updated 3 years ago
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,577May 12, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ποΈ Highlight text in documentsβ113Feb 13, 2026Updated 3 months ago
- β33Apr 23, 2023Updated 3 years ago
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.β15Feb 25, 2025Updated last year
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50May 8, 2023Updated 3 years ago
- KL3M training data collection and preprocessingβ22Apr 14, 2025Updated last year
- Magnitude fork that only supports Word2Vec, GloVe and fastText embeddingsβ13Aug 11, 2020Updated 5 years ago
- API client for fetching and comparing passages from legislationβ14Jan 26, 2025Updated last year
- Sentence Embedding as a Serviceβ15Jun 30, 2025Updated 10 months ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AIβs LLaMA 7B trained on the RedPajama datasetβ7,528Jul 16, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpusβ13Jan 2, 2021Updated 5 years ago
- My attempt at making a GPT agent for pentestingβ44May 10, 2023Updated 3 years ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)β1,148Jan 4, 2024Updated 2 years ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.β826Jul 15, 2025Updated 10 months ago
- Python 2.7 hashing and iteration in Python 3+β18Nov 20, 2022Updated 3 years ago
- π βοΈ ETL processes for medical and scientific papersβ681Dec 7, 2025Updated 5 months ago
- π Semantic search for headlines and story textβ359Sep 23, 2023Updated 2 years ago
- An easy way to host your own AI API and expose alternative models, while being compatible with "open" AI clients.β328Jul 16, 2024Updated last year
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"β17Feb 11, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sβ¦β2,666Mar 4, 2026Updated 2 months ago
- Github repo of the CHARLIE AI interaction projectβ14Aug 2, 2023Updated 2 years ago
- A tiny library for coding with large language models.β1,234Jul 10, 2024Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-sβ¦β221Jan 20, 2025Updated last year
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroβ¦β3,037Feb 11, 2026Updated 3 months ago
- β475Dec 27, 2023Updated 2 years ago
- Buzz AI, aka gt-chat, is a fast and intuitive question-answering chatbot for Georgia Tech. Powered by Next.js, FastAPI, and OpenAI, it soβ¦β30Apr 13, 2023Updated 3 years ago
- π Semantic search for developersβ542Sep 23, 2023Updated 2 years ago
- Command Line Interface for Hugging Face Inference Endpointsβ65Apr 10, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such asβ¦β358Jul 4, 2023Updated 2 years ago
- Datasets for Instruction Tuning of Large Language Modelsβ260Nov 30, 2023Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β52Jul 10, 2024Updated last year
- Efficient few-shot learning with Sentence Transformersβ2,735Apr 17, 2026Updated last month
- data cleaning and curation for unstructured textβ329Aug 6, 2024Updated last year
- Graph database library that allows you to store, analyze, and search through your data in a graph format. By using the Universal Sentenceβ¦β16May 26, 2023Updated 2 years ago
- β‘ Langchain apps in production using Jina & FastAPIβ1,640Sep 20, 2023Updated 2 years ago