marqo-ai / GCL
Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contrastive learning framework.
β56Updated 2 weeks ago
Alternatives and similar repositories for GCL:
Users that are interested in GCL are comparing it to the libraries listed below
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.β156Updated 10 months ago
- Code for NeurIPS LLM Efficiency Challengeβ55Updated 10 months ago
- NLP with Rust for Python π¦πβ61Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 7 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β119Updated last month
- Index of URLs to pdf files all over the internet and scriptsβ21Updated last year
- Set of scripts to finetune LLMsβ36Updated 10 months ago
- β27Updated 3 months ago
- β58Updated 11 months ago
- β31Updated 7 months ago
- β24Updated last year
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ184Updated 4 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated last year
- Experiments for efforts to train a new and improved t5β77Updated 10 months ago
- Python API for https://vespa.ai, the open big data serving engineβ113Updated this week
- QLoRA for Masked Language Modelingβ21Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β79Updated 10 months ago
- Generalist and Lightweight Model for Text Classificationβ65Updated 3 weeks ago
- π€ Trade any tensors over the networkβ30Updated last year
- Fine-tuning OpenAI CLIP Model for Image Search on medical imagesβ76Updated 2 years ago
- Chunk your text using gpt4o-mini more accuratelyβ43Updated 6 months ago
- State-of-the-art CLIP/SigLIP embedding models finetuned for the fashion domain. +57% increase in evaluation metrics vs FashionCLIP 2.0.β77Updated 4 months ago
- M4 experiment logbookβ56Updated last year
- β46Updated last year
- β40Updated 9 months ago
- Repository containing awesome resources regarding Hugging Face tooling.β46Updated last year
- β41Updated 2 weeks ago
- Supercharge huggingface transformers with model parallelism.β76Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ54Updated 5 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMsβ81Updated 2 months ago