π Datasets and models for instruction-tuning
β238Sep 23, 2023Updated 2 years ago
Alternatives and similar repositories for txtinstruct
Users that are interested in txtinstruct are comparing it to the libraries listed below
Sorting:
- β‘ Local chat assistants with AI superpowersβ337Feb 13, 2026Updated 2 weeks ago
- Python client for txtaiβ15Updated this week
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,210Feb 22, 2026Updated last week
- β17Sep 9, 2022Updated 3 years ago
- API client for fetching and comparing passages from legislationβ14Jan 26, 2025Updated last year
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.β15Feb 25, 2025Updated last year
- Tokenizer for Text to Speech (TTS) modelsβ13Jan 16, 2025Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpusβ13Jan 2, 2021Updated 5 years ago
- KL3M training data collection and preprocessingβ20Apr 14, 2025Updated 10 months ago
- β25Dec 28, 2022Updated 3 years ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.β823Jul 15, 2025Updated 7 months ago
- Github repo of the CHARLIE AI interaction projectβ14Aug 2, 2023Updated 2 years ago
- Graph database library that allows you to store, analyze, and search through your data in a graph format. By using the Universal Sentenceβ¦β16May 26, 2023Updated 2 years ago
- β33Apr 23, 2023Updated 2 years ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AIβs LLaMA 7B trained on the RedPajama datasetβ7,533Jul 16, 2023Updated 2 years ago
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50May 8, 2023Updated 2 years ago
- Sentence Embedding as a Serviceβ15Jun 30, 2025Updated 8 months ago
- A simple library for segmenting legal textsβ17Apr 22, 2023Updated 2 years ago
- Buzz AI, aka gt-chat, is a fast and intuitive question-answering chatbot for Georgia Tech. Powered by Next.js, FastAPI, and OpenAI, it soβ¦β30Apr 13, 2023Updated 2 years ago
- π€ State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch π€― Create a bot, now π«΅β349Jun 10, 2023Updated 2 years ago
- A tiny library for coding with large language models.β1,233Jul 10, 2024Updated last year
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroβ¦β3,019Feb 11, 2026Updated 2 weeks ago
- Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sβ¦β2,665Feb 20, 2026Updated last week
- π Semantic search for headlines and story textβ359Sep 23, 2023Updated 2 years ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)β1,143Jan 4, 2024Updated 2 years ago
- π βοΈ ETL processes for medical and scientific papersβ671Dec 7, 2025Updated 2 months ago
- data cleaning and curation for unstructured textβ329Aug 6, 2024Updated last year
- AI Data Management & Evaluation Platformβ215Oct 5, 2023Updated 2 years ago
- π¦ Explore multimedia datasets at scaleβ1,062Dec 7, 2024Updated last year
- β‘ Langchain apps in production using Jina & FastAPIβ1,633Sep 20, 2023Updated 2 years ago
- A language for constraint-guided and efficient LLM programming.β4,154May 22, 2025Updated 9 months ago
- β37May 31, 2023Updated 2 years ago
- create workflows with LLMsβ55Aug 2, 2024Updated last year
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning modβ¦β20Dec 9, 2022Updated 3 years ago
- Efficient few-shot learning with Sentence Transformersβ2,688Dec 11, 2025Updated 2 months ago
- Salesforce open-source LLMs with 8k sequence length.β725Jan 31, 2025Updated last year
- Mine-tuning is a methodology for synchronizing human and AI attention.β19Jun 16, 2024Updated last year
- NLP Web API for Legal Textβ18Dec 23, 2022Updated 3 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,081Jul 1, 2025Updated 8 months ago