trapoom555 / Language-Model-STS-CFTLinks

Improving Text Embedding of Language Models Using Contrastive Fine-tuning

☆65

Alternatives and similar repositories for Language-Model-STS-CFT

Users that are interested in Language-Model-STS-CFT are comparing it to the libraries listed below

Sorting:

TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated last year
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated 2 years ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆79Updated last year
arcee-ai / DAM
☆55Updated last year
JHU-CLSP / RATIONALYST
Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
☆35Updated last year
nlp-uoregon / ullme
☆20Updated 7 months ago
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 4 months ago
icip-cas / SelfRetrieval
☆36Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆31Updated 10 months ago
ielab / Starbucks
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆22Updated 5 months ago
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated last year
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 9 months ago
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆80Updated 7 months ago
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆57Updated last week
Zyphra / Zyda_processing
☆39Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆115Updated 9 months ago
tigerchen52 / awesome_role_of_small_models
a curated list of the role of small models in the LLM era
☆110Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
Tebmer / Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆27Updated 11 months ago
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆76Updated last year
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated 2 years ago
princeton-nlp / LitSearch
[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
☆101Updated last year
gersteinlab / Struc-Bench
[NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naa…
☆55Updated 4 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆91Updated last year
facebookresearch / mexma
MEXMA: Token-level objectives improve sentence representations
☆42Updated 10 months ago
mungg / FABLES
☆58Updated last year
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆45Updated 2 months ago
Aleph-Alpha-Research / trigrams
☆58Updated 2 weeks ago