Marqo's Course on 'Fine-Tuning Embedding Models for Semantic Search'.
☆57Jul 29, 2024Updated last year
Alternatives and similar repositories for fine-tuning-embedding-models-course
Users that are interested in fine-tuning-embedding-models-course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jun 16, 2026Updated 2 weeks ago
- EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction☆26May 22, 2024Updated 2 years ago
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- Automated testing and benchmarking for code generation agents.☆18Jun 27, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code used for articles published at Nvidia's Developer Blog☆12Jun 16, 2022Updated 4 years ago
- State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP.☆46Nov 13, 2024Updated last year
- Python client for Marqo☆31Jun 25, 2026Updated last week
- This is the repo where I save #Terraform recipes, mostly posted in cduser.com☆10May 17, 2021Updated 5 years ago
- ☆17Jan 5, 2023Updated 3 years ago
- Machine Learning Engineer interview preparation. Brushing up Data Structures & Algorithms, System Design and SQL☆25Jun 10, 2021Updated 5 years ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated 2 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- organize data visualization output, automatically picking meaningful names based on semantic plotting variables☆12Apr 21, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware i…☆29Mar 8, 2026Updated 3 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆24Jun 30, 2025Updated last year
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 3 years ago
- [NeurIPS 2024] 🕸 GlotCC Dataset and Pipline☆20Apr 6, 2025Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆21Apr 27, 2026Updated 2 months ago
- This is the project for IRM methods☆12Sep 13, 2021Updated 4 years ago
- Python implementation of machine learning and Ai algorithms from scratch☆11Feb 12, 2025Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆83Mar 18, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This project contains my solution for all the data structures and algorithms on Algo Expert, Hackerrank and Leetcode. This repository is …☆10Jan 24, 2021Updated 5 years ago
- AAAI23-Directed Acyclic Graph Structure Learning from Dynamic Graphs☆12Nov 25, 2022Updated 3 years ago
- Code and dataset for the EMNLP 2024 paper: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory☆51Sep 26, 2024Updated last year
- In this project, we have to create a predictive model which allows the company to maximize the profit of the next marketing campaign☆17Oct 18, 2025Updated 8 months ago
- How to install and use tesseract OCR on Centos7 - without root access☆14Jan 21, 2021Updated 5 years ago
- InfluxDB 2.0 Dashboards Renderer as images☆15Dec 16, 2022Updated 3 years ago
- 北语 246 实验室新生简明指南☆10May 30, 2022Updated 4 years ago
- Source code for Jordan Boyd-Graber's academic webpage.☆12Jun 19, 2026Updated last week
- A massively multilingual modern encoder language model☆143Jan 20, 2026Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A buffered output plugin for Fluentd and InfluxDB 2☆21Sep 22, 2025Updated 9 months ago
- ☆11Apr 10, 2026Updated 2 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated 2 years ago
- Collection script and Grafana Dashboard for tracking Progress of Covid-19☆18Dec 1, 2020Updated 5 years ago
- GraalVM GitHub action☆13Jun 25, 2022Updated 4 years ago
- the full pipeline for model retraining with fastapi and github actions☆16Jul 5, 2024Updated last year
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"☆13Jul 23, 2023Updated 2 years ago