microsoft / REBEL
☆24Updated this week
Alternatives and similar repositories for REBEL:
Users that are interested in REBEL are comparing it to the libraries listed below
- ☆40Updated 2 months ago
- ☆8Updated 9 months ago
- A sample pattern for running CI tests on Modal☆17Updated this week
- ☆28Updated last week
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆23Updated last year
- Creating Generative AI Apps which work☆17Updated this week
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆34Updated this week
- ☆15Updated last year
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆19Updated 3 weeks ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- QLoRA for Masked Language Modeling☆22Updated last year
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆23Updated last month
- ☆14Updated 9 months ago
- PyTorch implementation for MRL☆18Updated last year
- ☆28Updated 5 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 6 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆42Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆19Updated 2 months ago
- Pre-train Static Word Embeddings☆53Updated this week
- ☆21Updated this week
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆15Updated last week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 2 weeks ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆17Updated last week
- LLM training in simple, raw C/CUDA☆14Updated 4 months ago
- ☆21Updated this week