A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs and agents.
☆65Feb 6, 2025Updated last year
Alternatives and similar repositories for ai-blueprint
Users that are interested in ai-blueprint are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Plug-and-play document AI with zero-shot models.☆125May 11, 2026Updated 2 weeks ago
- Synthetic Text Dataset Generation for LLM projects☆58Apr 17, 2026Updated last month
- Load embeddings and featurize your sentences.☆31Oct 23, 2024Updated last year
- SynthGenAI - Package for Generating Synthetic Datasets using LLMs.☆56Nov 24, 2025Updated 6 months ago
- Nearly Inference Free Embeddings: make your RAG queries 500x faster☆77Apr 27, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- synthetic data for ml☆25Jan 30, 2025Updated last year
- Efficiently find the best-suited language model (LM) for your NLP task☆135Jul 26, 2025Updated 10 months ago
- ☆10Sep 29, 2024Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- Python library to use Pleias-RAG models☆71May 8, 2026Updated 2 weeks ago
- Build datasets using natural language☆575Sep 19, 2025Updated 8 months ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated 2 years ago
- 🤗 Collection of examples on how to train, deploy and monitor HuggingFace models in Google Cloud Vertex AI☆23Feb 26, 2024Updated 2 years ago
- Datamodels for hugging face tokenizers☆107Apr 28, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- ☆10Oct 2, 2024Updated last year
- ☆15Apr 14, 2026Updated last month
- ☆17Apr 30, 2025Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆60May 11, 2023Updated 3 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 8 months ago
- A Structured Output Benchmark whose 'ground-truth' is actually right☆19Dec 5, 2025Updated 5 months ago
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆30Jan 14, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- awesome synthetic (text) datasets☆332Jan 8, 2026Updated 4 months ago
- chrome & firefox extension to chat with webpages: local llms☆130Dec 20, 2024Updated last year
- 🤖 Telegram chatbot frontend for Searx.☆16Nov 25, 2018Updated 7 years ago
- Generalist and Lightweight Model for Text Classification☆217May 19, 2026Updated last week
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware i…☆29Mar 8, 2026Updated 2 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 10 months ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 3 years ago
- A trivial raycaster using minifb for rendering/input☆14Jan 2, 2023Updated 3 years ago
- Implement FlashAttention v2 with minimal code to learn.☆16Jun 12, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Jul 19, 2024Updated last year
- ☆14Oct 28, 2024Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆21Apr 27, 2026Updated last month
- Unattended Lightweight Text Classifiers with LLM Embeddings☆186Sep 6, 2024Updated last year
- Feature Selection using Simulated Annealing☆11Aug 10, 2022Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156May 24, 2024Updated 2 years ago
- Fact checking baseline combining dense retrieval and textual entailment☆30Aug 10, 2025Updated 9 months ago