andrewgcodes / vec2vecLinks
☆16Updated 2 years ago
Alternatives and similar repositories for vec2vec
Users that are interested in vec2vec are comparing it to the libraries listed below
Sorting:
- Training code for Sparse Autoencoders on Embedding models☆38Updated 7 months ago
- Text-writing denoising diffusion (and much more)☆30Updated 2 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- One stop shop for all things carp☆59Updated 3 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆41Updated last year
- Efficient few-shot learning with cross-encoders.☆58Updated last year
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…☆52Updated 2 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- ☆29Updated 2 years ago
- ☆44Updated last year
- Entailment self-training☆25Updated 2 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- Local emulator for Hugging Face Inference Endpoints customer handlers☆26Updated 2 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- An open-source replication and extension of the Meta AI's LLAMA dataset☆24Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Smol but mighty language model☆63Updated 2 years ago
- ☆43Updated 2 years ago
- ☆49Updated last year
- Rust bindings for CTranslate2☆14Updated 2 years ago
- ☆63Updated last year
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆44Updated 5 years ago