Gleghorn-Lab / Mixture-of-Experts-Sentence-SimilarityLinks

☆15

Alternatives and similar repositories for Mixture-of-Experts-Sentence-Similarity

Users that are interested in Mixture-of-Experts-Sentence-Similarity are comparing it to the libraries listed below

Sorting:

Alab-NII / Awesome-SciLM
Pre-trained Language Model for Scientific Text
☆45Updated last year
alon-albalak / FLAD
Few-shot Learning with Auxiliary Data
☆28Updated last year
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
debjitpaul / Causal_CoT
About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…
☆11Updated 10 months ago
EagleW / Stage-wise-Fine-tuning
Code for Stage-wise Fine-tuning for Graph-to-Text Generation
☆26Updated 2 years ago
kyegomez / EAOT
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
☆20Updated last year
wangskyGit / passage-sieve
official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization
☆13Updated last year
gersteinlab / BioCoder
☆43Updated last year
princeton-nlp / LitSearch
[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
☆90Updated 7 months ago
cambridgeltl / mop
Codes for paper: Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT
☆34Updated 3 years ago
feyzaakyurek / rl4f
Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.
☆64Updated 7 months ago
sanyalsunny111 / Early_Weight_Avg
[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training
☆16Updated 9 months ago
JeremyAlain / imitation_learning_from_language_feedback
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Updated 2 years ago
instance-wise-ordered-transformer / IOT
☆20Updated 4 years ago
ielab / PromptReps
Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
☆50Updated 3 weeks ago
yifeiwang77 / Self-Correction
☆20Updated 8 months ago
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated 8 months ago
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated 2 weeks ago
stonkgs / stonkgs
Multimodal Transformers for biomedical text and Knowledge Graph data
☆34Updated 2 years ago
thoppe / The-Pile-PubMed
Download, parse, and filter data PubMed, data-ready for The-Pile
☆23Updated 3 years ago
yumeng5 / FewGen
[ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
☆42Updated 2 years ago
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆56Updated last year
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆48Updated last year
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆58Updated 2 years ago
Leezekun / MacRAG
☆16Updated 2 weeks ago
CAMTL / CA-MTL
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
☆57Updated 3 years ago
Yuanhy1997 / SeqDiffuSeq
Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]
☆96Updated last year
RUCAIBox / BAMBOO
☆35Updated last year
EleutherAI / semantic-memorization
☆44Updated 8 months ago
xxxiaol / QRData
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
☆41Updated 4 months ago