A deep dive into embeddings starting from fundamentals
☆1,079Jan 17, 2026Updated 4 months ago
Alternatives and similar repositories for what_are_embeddings
Users that are interested in what_are_embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Good books, good vibes☆434Jan 6, 2024Updated 2 years ago
- Toolkit to forge scikit-learn compatible estimators☆19Jun 1, 2026Updated last week
- Data Visualizations for the #30DayChartChallenge☆11Apr 15, 2024Updated 2 years ago
- Machine Learning Engineering Open Book☆18,056May 18, 2026Updated 3 weeks ago
- DSPy: The framework for programming—not prompting—language models☆34,958Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21May 13, 2025Updated last year
- Write Datasette canned queries as plain SQL files☆14Jul 2, 2022Updated 3 years ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,549Jul 1, 2024Updated last year
- Structured Outputs☆13,947May 18, 2026Updated 3 weeks ago
- The release of the Twitter algorithm, annotated for recsys☆497Apr 15, 2023Updated 3 years ago
- A guidance language for controlling large language models.☆21,488May 21, 2026Updated 3 weeks ago
- Boring ML Generated Site☆19Oct 1, 2022Updated 3 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆895Apr 17, 2024Updated 2 years ago
- Go ahead and axolotl questions☆12,032Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- It's a cooler way to store simple linear models.☆26Jul 15, 2024Updated last year
- structured outputs for llms☆13,135Updated this week
- ☆10Feb 12, 2024Updated 2 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆526Feb 12, 2026Updated 4 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,935May 17, 2025Updated last year
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆773Jul 12, 2023Updated 2 years ago
- LlamaIndex is the leading document agent and OCR platform☆50,073Updated this week
- Hackers' Guide to Language Models☆1,869Dec 13, 2024Updated last year
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆747Jun 2, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Explanation to key concepts in ML☆8,565Jun 30, 2025Updated 11 months ago
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆79,918Feb 5, 2026Updated 4 months ago
- ShellSage saves sysadmins’ sanity by solving shell script snafus super swiftly☆402Jun 1, 2026Updated last week
- Notes from the Latent Space paper club. Follow along or start your own!☆249Jul 31, 2024Updated last year
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆96,979Jun 2, 2026Updated last week
- ☆170Jun 3, 2024Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆59,420Nov 12, 2025Updated 7 months ago
- LLM101n: Let's build a Storyteller☆37,259Aug 1, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Understanding Deep Learning - Simon J.D. Prince☆9,541Feb 24, 2026Updated 3 months ago
- llama3 implementation one matrix multiplication at a time☆15,230May 23, 2024Updated 2 years ago
- SQL functions for calling OpenAI APIs☆22Jan 14, 2023Updated 3 years ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,414Updated this week
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Jul 15, 2022Updated 3 years ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,924Feb 24, 2024Updated 2 years ago
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated 2 years ago