krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆18Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for matryoshka-representation-learning
- Embedding Recycling for Language models☆38Updated last year
- ☆40Updated 2 weeks ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆27Updated 4 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated last month
- Using short models to classify long texts☆20Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆23Updated 3 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆22Updated 9 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 7 months ago
- ☆31Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆61Updated 4 months ago
- ☆27Updated 5 months ago
- LLM training in simple, raw C/CUDA☆12Updated last month
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- ☆41Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆30Updated 9 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆35Updated last month
- SCREWS: A Modular Framework for Reasoning with Revisions☆26Updated last year
- ☆24Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated last month
- Minimum Description Length probing for neural network representations☆16Updated last week
- ☆46Updated this week
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆22Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆26Updated 11 months ago
- Aioli: A unified optimization framework for language model data mixing☆13Updated last week
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆38Updated 3 weeks ago