nomic-ai / contrastorsLinks
Train Models Contrastively in Pytorch
☆754Updated 8 months ago
Alternatives and similar repositories for contrastors
Users that are interested in contrastors are comparing it to the libraries listed below
Sorting:
- Generative Representational Instruction Tuning☆679Updated 5 months ago
- Easily embed, cluster and semantically label text datasets☆584Updated last year
- ☆556Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆732Updated last year
- Official repository for ORPO☆467Updated last year
- Evaluation suite for LLMs☆370Updated 4 months ago
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,081Updated 10 months ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆802Updated 4 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆555Updated last week
- Data and tools for generating and inspecting OLMo pre-training data.☆1,355Updated last month
- ☆446Updated last year
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆662Updated last year
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,017Updated 7 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,406Updated last year
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆345Updated 11 months ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆630Updated last year
- Official inference library for pre-processing of Mistral models☆818Updated this week
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆750Updated last year
- ☆581Updated last year
- Bringing BERT into modernity via both architecture changes and scaling☆1,572Updated 5 months ago
- awesome synthetic (text) datasets☆310Updated 2 weeks ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,643Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,222Updated last year
- A repository for research on medium sized language models.☆520Updated 6 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Updated last year
- [ACL 2024] Progressive LLaMA with Block Expansion.☆513Updated last year
- An Open Source Toolkit For LLM Distillation☆785Updated 4 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆686Updated last year
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆560Updated 11 months ago
- Code repository for the paper - "Matryoshka Representation Learning"☆584Updated last year