marepilc / pink-parquetLinks
User-friendly viewer for Parquet files
☆9Updated 7 months ago
Alternatives and similar repositories for pink-parquet
Users that are interested in pink-parquet are comparing it to the libraries listed below
Sorting:
- Lightweight Hybrid Search and Reranking☆10Updated 2 months ago
- Training code for Sparse Autoencoders on Embedding models☆38Updated 3 months ago
- ANE accelerated embedding models!☆17Updated 5 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- My NER Experiments with ModernBERT☆21Updated 3 weeks ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆10Updated 2 weeks ago
- ☆15Updated last year
- Nodejs script to run an LLM prompt across a bunch of models.☆9Updated 5 months ago
- ☆38Updated last year
- ☆21Updated last month
- Efficiently computing & storing token n-grams from large corpora☆23Updated 8 months ago
- ☆11Updated 2 months ago
- ☆18Updated last year
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆15Updated 2 months ago
- ☆12Updated 6 months ago
- ☆12Updated 6 months ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆20Updated 2 weeks ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated last month
- Tree-based indexes for neural-search☆32Updated last year
- Model implementation for the contextual embeddings project☆26Updated this week
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Pre-train Static Word Embeddings☆76Updated this week
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebook☆14Updated 2 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆50Updated 8 months ago
- A sample pattern for running CI tests on Modal☆18Updated last month
- llama.cpp gguf file parser for javascript☆42Updated 5 months ago
- Tensor library for Zig☆11Updated 6 months ago
- ☆13Updated 9 months ago