microsoft / REBEL
☆34Updated 2 weeks ago
Alternatives and similar repositories for REBEL:
Users that are interested in REBEL are comparing it to the libraries listed below
- Creating Generative AI Apps which work☆17Updated 3 weeks ago
- ☆41Updated 2 years ago
- PyTorch implementation for MRL☆18Updated last year
- A sample pattern for running CI tests on Modal☆17Updated 3 weeks ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆18Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated last month
- ☆20Updated this week
- Pre-train Static Word Embeddings☆58Updated 3 weeks ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated last month
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆10Updated 4 months ago
- ☆43Updated 2 months ago
- ☆28Updated 5 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆24Updated 2 months ago
- ☆8Updated 9 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆30Updated 8 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆24Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated 2 months ago
- Embedding Recycling for Language models☆38Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆19Updated 3 months ago
- ☆19Updated 6 months ago
- ☆15Updated last year
- ☆16Updated last year
- ☆39Updated this week
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year