andrewgcodes / vec2vecLinks
☆17Updated 2 years ago
Alternatives and similar repositories for vec2vec
Users that are interested in vec2vec are comparing it to the libraries listed below
Sorting:
- Entailment self-training☆26Updated 2 years ago
- One stop shop for all things carp☆59Updated 3 years ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- ☆29Updated 2 years ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…☆52Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- ☆44Updated last year
- ☆19Updated 2 years ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆49Updated 2 years ago
- Training code for Sparse Autoencoders on Embedding models☆39Updated 11 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- LLM sampling method for enforcing syntax adherence in generated output☆25Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- Elevate your language models with insightful diversity metrics.☆11Updated 2 years ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"☆19Updated 3 years ago
- A sample pattern for running CI tests on Modal☆19Updated 9 months ago
- ☆19Updated 2 years ago
- Efficient few-shot learning with cross-encoders.☆62Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 4 years ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Smol but mighty language model☆65Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- Detecting gibberish as a type of sentiment analysis with GPT2☆25Updated 5 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- Embedding Recycling for Language models☆38Updated 2 years ago
- Causal DAG Extraction from Text (DEFT)☆66Updated last year
- Latent Diffusion Language Models☆70Updated 2 years ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago