A deep dive into embeddings starting from fundamentals
☆1,068Jan 17, 2026Updated 3 months ago
Alternatives and similar repositories for what_are_embeddings
Users that are interested in what_are_embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Good books, good vibes☆434Jan 6, 2024Updated 2 years ago
- Toolkit to forge scikit-learn compatible estimators☆19Updated this week
- Data Visualizations for the #30DayChartChallenge☆11Apr 15, 2024Updated 2 years ago
- Machine Learning Engineering Open Book☆17,820Mar 16, 2026Updated last month
- Updating list of favorite internet essays☆52Mar 7, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- DSPy: The framework for programming—not prompting—language models☆34,016Apr 24, 2026Updated last week
- ☆21May 13, 2025Updated 11 months ago
- Write Datasette canned queries as plain SQL files☆14Jul 2, 2022Updated 3 years ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,454Jul 1, 2024Updated last year
- Structured Outputs☆13,741Apr 16, 2026Updated 2 weeks ago
- The release of the Twitter algorithm, annotated for recsys☆498Apr 15, 2023Updated 3 years ago
- A guidance language for controlling large language models.☆21,408Apr 10, 2026Updated 3 weeks ago
- Boring ML Generated Site☆19Oct 1, 2022Updated 3 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆895Apr 17, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Go ahead and axolotl questions☆11,779Updated this week
- It's a cooler way to store simple linear models.☆26Jul 15, 2024Updated last year
- structured outputs for llms☆12,840Apr 22, 2026Updated last week
- ☆10Feb 12, 2024Updated 2 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆524Feb 12, 2026Updated 2 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,914May 17, 2025Updated 11 months ago
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆774Jul 12, 2023Updated 2 years ago
- LlamaIndex is the leading document agent and OCR platform☆48,997Updated this week
- (K3IM) Keras 3 Image Models☆22Feb 22, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Hackers' Guide to Language Models☆1,869Dec 13, 2024Updated last year
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆744Updated this week
- Explanation to key concepts in ML☆8,553Jun 30, 2025Updated 10 months ago
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆78,872Feb 5, 2026Updated 2 months ago
- ShellSage saves sysadmins’ sanity by solving shell script snafus super swiftly☆399Updated this week
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆91,680Apr 16, 2026Updated 2 weeks ago
- Notes from the Latent Space paper club. Follow along or start your own!☆247Jul 31, 2024Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆57,242Nov 12, 2025Updated 5 months ago
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆170Jun 3, 2024Updated last year
- LLM101n: Let's build a Storyteller☆36,854Aug 1, 2024Updated last year
- Understanding Deep Learning - Simon J.D. Prince☆9,405Feb 24, 2026Updated 2 months ago
- llama3 implementation one matrix multiplication at a time☆15,243May 23, 2024Updated last year
- SQL functions for calling OpenAI APIs☆22Jan 14, 2023Updated 3 years ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,326Apr 25, 2026Updated last week
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆16Dec 16, 2024Updated last year