Gleghorn-Lab / Mixture-of-Experts-Sentence-SimilarityLinks
☆15Updated 4 months ago
Alternatives and similar repositories for Mixture-of-Experts-Sentence-Similarity
Users that are interested in Mixture-of-Experts-Sentence-Similarity are comparing it to the libraries listed below
Sorting:
- Pre-trained Language Model for Scientific Text☆45Updated last year
- Few-shot Learning with Auxiliary Data☆28Updated last year
- Embedding Recycling for Language models☆39Updated 2 years ago
- About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…☆11Updated 10 months ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆20Updated last year
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- ☆43Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆90Updated 7 months ago
- Codes for paper: Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT☆34Updated 3 years ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Updated 7 months ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆16Updated 9 months ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- ☆20Updated 4 years ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆50Updated 3 weeks ago
- ☆20Updated 8 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Updated 8 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated 2 weeks ago
- Multimodal Transformers for biomedical text and Knowledge Graph data☆34Updated 2 years ago
- Download, parse, and filter data PubMed, data-ready for The-Pile☆23Updated 3 years ago
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆42Updated 2 years ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆56Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆58Updated 2 years ago
- ☆16Updated 2 weeks ago
- Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data☆57Updated 3 years ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆96Updated last year
- ☆35Updated last year
- ☆44Updated 8 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆41Updated 4 months ago