User-friendly viewer for Parquet files
☆10Jan 10, 2026Updated last month
Alternatives and similar repositories for pink-parquet
Users that are interested in pink-parquet are comparing it to the libraries listed below
Sorting:
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 7 months ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆18May 15, 2025Updated 9 months ago
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- DImensionality REduction in JAX☆25Nov 21, 2025Updated 3 months ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- Contextualized per-token embeddings☆34May 11, 2025Updated 9 months ago
- ☆19Oct 24, 2023Updated 2 years ago
- Snappy decompression with WebAssembly☆28Feb 22, 2026Updated 2 weeks ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- fine-tuning tutorial☆18Feb 20, 2026Updated 2 weeks ago
- Mathematical foundations of data analysis, Winter semester 22-23☆13Jan 31, 2023Updated 3 years ago
- DOS Program Development☆13Nov 9, 2022Updated 3 years ago
- A library for probing Stockfish's NNUEs. The code for reading parameters and forward propagation is taken from Stockfish☆12Nov 18, 2025Updated 3 months ago
- A complete pipeline for fine-tuning YOLOv8 pose models with custom datasets. Supports automatic and semi-automatic annotation for efficie…☆15Feb 9, 2025Updated last year
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated last year
- ☆11Dec 6, 2023Updated 2 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- ☆10Oct 27, 2023Updated 2 years ago
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Feb 14, 2026Updated 3 weeks ago
- LightGBM for handling label-imbalanced data with focal and weighted loss functions in binary and multiclass classification☆21Jan 29, 2026Updated last month
- ☆14Dec 12, 2022Updated 3 years ago
- ☆10Jan 9, 2024Updated 2 years ago
- Less-Resilient MapReduce for Go☆10Feb 15, 2023Updated 3 years ago
- 2021 Line Webtoon Year-in-Review Project :: Animation Production☆10Feb 21, 2023Updated 3 years ago
- Hunt Town is a web3 co-building community where builders come together to contribute to the expansion of web3 culture and products.☆14Jan 15, 2026Updated last month
- ☆11Jun 17, 2024Updated last year
- Yet Another Color Gamut Visualizer☆12Jun 1, 2019Updated 6 years ago
- Git plugin for Commit Driven Development☆10Dec 20, 2019Updated 6 years ago
- Token-aware HTML chunking that preserves structure and attributes, with optional cleaning and attribute length control.☆15Aug 12, 2025Updated 6 months ago
- Amazon Bedrock 의 Nova, Claude 3.7 모델을 활용하여 pdf 도면을 파싱 합니다.☆12May 19, 2025Updated 9 months ago
- 🛰️ Assets for Station☆13Aug 18, 2024Updated last year
- 코로나19 발생현황 변동 및 새 공지사항 푸시알림 서비스(질병관리본부 코로나19 홈페이지 데이터 이용)☆12Jan 5, 2023Updated 3 years ago