marepilc / pink-parquetLinks
User-friendly viewer for Parquet files
☆9Updated 8 months ago
Alternatives and similar repositories for pink-parquet
Users that are interested in pink-parquet are comparing it to the libraries listed below
Sorting:
- Lightweight Hybrid Search and Reranking☆10Updated 3 months ago
- My NER Experiments with ModernBERT☆21Updated last month
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- ANE accelerated embedding models!☆18Updated 6 months ago
- ☆15Updated last year
- efficient query encoding for dense retrieval☆11Updated 10 months ago
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆15Updated 3 months ago
- ☆18Updated last year
- Training code for Sparse Autoencoders on Embedding models☆38Updated 4 months ago
- ☆23Updated last month
- Code for SaGe subword tokenizer (EACL 2023)☆25Updated 6 months ago
- ☆11Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated last week
- ☆10Updated 2 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆50Updated 9 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 10 months ago
- ☆15Updated 2 months ago
- code for training and using chess embeddings models☆12Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 3 months ago
- 🦖 X—LLM: Simple & Cutting Edge LLM Finetuning☆11Updated last year
- ☆12Updated 6 months ago
- ☆23Updated 6 months ago
- ☆13Updated 10 months ago
- ☆39Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆59Updated last month
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- MEXMA: Token-level objectives improve sentence representations☆41Updated 5 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago