A deep dive into embeddings starting from fundamentals
☆1,085Jan 17, 2026Updated 5 months ago
Alternatives and similar repositories for what_are_embeddings
Users that are interested in what_are_embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Good books, good vibes☆435Jan 6, 2024Updated 2 years ago
- Toolkit to forge scikit-learn compatible estimators☆19Jun 1, 2026Updated last month
- Data Visualizations for the #30DayChartChallenge☆11Apr 15, 2024Updated 2 years ago
- Machine Learning Engineering Open Book☆18,203Updated this week
- DSPy: The framework for programming—not prompting—language models☆35,605Jun 25, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆21May 13, 2025Updated last year
- Write Datasette canned queries as plain SQL files☆14Jul 2, 2022Updated 4 years ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,598Jul 1, 2024Updated 2 years ago
- Structured Outputs☆14,273Updated this week
- The release of the Twitter algorithm, annotated for recsys☆498Apr 15, 2023Updated 3 years ago
- A guidance language for controlling large language models.☆21,519May 21, 2026Updated last month
- 🤖 A PyTorch library of curated Transformer models and their composable components☆895Apr 17, 2024Updated 2 years ago
- Go ahead and axolotl questions☆12,121Updated this week
- It's a cooler way to store simple linear models.☆26Jul 15, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- structured outputs for llms☆13,328Updated this week
- ☆10Feb 12, 2024Updated 2 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆527Feb 12, 2026Updated 4 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,938May 17, 2025Updated last year
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆772Jul 12, 2023Updated 2 years ago
- LlamaIndex is the leading document agent and OCR platform☆50,533Updated this week
- Hackers' Guide to Language Models☆1,869Dec 13, 2024Updated last year
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆749Jun 24, 2026Updated last week
- Explanation to key concepts in ML☆8,574Jun 30, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆80,393Feb 5, 2026Updated 4 months ago
- ShellSage saves sysadmins’ sanity by solving shell script snafus super swiftly☆404Jun 1, 2026Updated last month
- Notes from the Latent Space paper club. Follow along or start your own!☆250Jul 31, 2024Updated last year
- data cleaning and curation for unstructured text☆330Aug 6, 2024Updated last year
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆98,270Jun 2, 2026Updated last month
- ☆170Jun 3, 2024Updated 2 years ago
- LLM101n: Let's build a Storyteller☆37,383Aug 1, 2024Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆60,304Nov 12, 2025Updated 7 months ago
- Understanding Deep Learning - Simon J.D. Prince☆9,581Feb 24, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- llama3 implementation one matrix multiplication at a time☆15,222May 23, 2024Updated 2 years ago
- SQL functions for calling OpenAI APIs☆22Jan 14, 2023Updated 3 years ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,449Updated this week
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Jul 15, 2022Updated 3 years ago
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆16Dec 16, 2024Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,930Feb 24, 2024Updated 2 years ago
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated 2 years ago