A deep dive into embeddings starting from fundamentals
☆1,055Jan 17, 2026Updated last month
Alternatives and similar repositories for what_are_embeddings
Users that are interested in what_are_embeddings are comparing it to the libraries listed below
Sorting:
- Good books, good vibes☆431Jan 6, 2024Updated 2 years ago
- Machine Learning Engineering Open Book☆17,286Feb 21, 2026Updated 2 weeks ago
- DSPy: The framework for programming—not prompting—language models☆32,519Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,358Jul 1, 2024Updated last year
- Structured Outputs☆13,488Mar 2, 2026Updated last week
- Toolkit to forge scikit-learn compatible estimators☆19Mar 1, 2026Updated last week
- A guidance language for controlling large language models.☆21,333Feb 13, 2026Updated 3 weeks ago
- just a bunch of useful embeddings for scikit-learn pipelines☆522Feb 12, 2026Updated 3 weeks ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆739Updated this week
- Go ahead and axolotl questions☆11,395Updated this week
- structured outputs for llms☆12,468Feb 25, 2026Updated last week
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆773Jul 12, 2023Updated 2 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆894Apr 17, 2024Updated last year
- LlamaIndex is the leading document agent and OCR platform☆47,374Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆54,071Nov 12, 2025Updated 3 months ago
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Aug 15, 2020Updated 5 years ago
- Explanation to key concepts in ML☆8,530Jun 30, 2025Updated 8 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,868May 17, 2025Updated 9 months ago
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆87,151Updated this week
- LLM101n: Let's build a Storyteller☆36,432Aug 1, 2024Updated last year
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆76,159Feb 5, 2026Updated last month
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,206Mar 1, 2026Updated last week
- Lightning ⚡️ fast forecasting with statistical and econometric models.☆4,708Updated this week
- llama3 implementation one matrix multiplication at a time☆15,242May 23, 2024Updated last year
- Clarity in the current fast-paced mess of Open Source innovation☆1,620Jan 20, 2025Updated last year
- A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with gi…☆19,550Updated this week
- Hackers' Guide to Language Models☆1,863Dec 13, 2024Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,185Aug 22, 2025Updated 6 months ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,905Feb 24, 2024Updated 2 years ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,602Dec 20, 2025Updated 2 months ago
- Creating beautiful plots of data maps☆982Mar 2, 2026Updated last week
- Numbers every LLM developer should know☆4,287Jan 16, 2024Updated 2 years ago
- Understanding Deep Learning - Simon J.D. Prince☆9,145Feb 24, 2026Updated last week
- Exporting python functions in R packages☆20Oct 27, 2023Updated 2 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,884Mar 2, 2026Updated last week
- It's a cooler way to store simple linear models.☆27Jul 15, 2024Updated last year
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- A playbook for systematically maximizing the performance of deep learning models.☆29,879Jun 18, 2024Updated last year