marepilc / pink-parquetLinks
User-friendly viewer for Parquet files
☆9Updated 8 months ago
Alternatives and similar repositories for pink-parquet
Users that are interested in pink-parquet are comparing it to the libraries listed below
Sorting:
- Lightweight Hybrid Search and Reranking☆10Updated 4 months ago
- ☆12Updated 3 months ago
- Nodejs script to run an LLM prompt across a bunch of models.☆9Updated 6 months ago
- My NER Experiments with ModernBERT and Ettin☆21Updated this week
- ☆21Updated 2 months ago
- Training code for Sparse Autoencoders on Embedding models☆38Updated 4 months ago
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆15Updated 4 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆24Updated 7 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Updated 5 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models☆12Updated 4 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Updated 4 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆25Updated 7 months ago
- ANE accelerated embedding models!☆18Updated 7 months ago
- ☆11Updated 9 months ago
- Friday Agents. App: https://chat.toolstack.run/☆11Updated 7 months ago
- ☆38Updated last year
- Forecastbench Datasets, updated nightly☆12Updated this week
- ☆56Updated 2 months ago
- LangChain-Kuzu integration☆10Updated 3 months ago
- ☆40Updated 2 months ago
- ☆10Updated 2 weeks ago
- a tool for gerenate dataset from doc☆12Updated 3 months ago
- ☆38Updated this week
- ☆15Updated last year
- https://footprints.baulab.info☆17Updated 9 months ago
- PathPiece tokenizer☆12Updated 8 months ago
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆14Updated 7 months ago
- ☆12Updated 7 months ago