huggingface/data-measurements-tool

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huggingface/data-measurements-tool)

huggingface / data-measurements-tool

Developing tools to automatically analyze datasets

☆75

Alternatives and similar repositories for data-measurements-tool

Users that are interested in data-measurements-tool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVIDIA / DALI_deps
View on GitHub
3rd party dependencies for DALI project
☆11Jul 15, 2026Updated last week
facebookresearch / lss_eval
View on GitHub
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Aug 25, 2023Updated 2 years ago
allenai / EmbeddingRecycling
View on GitHub
Embedding Recycling for Language models
☆38Jul 11, 2023Updated 3 years ago
robvanvolt / DALLE-tools
View on GitHub
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
☆14Mar 9, 2022Updated 4 years ago
monologg / ko_lm_dataformat
View on GitHub
A utility for storing and reading files for Korean LM training 💾
☆35Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
korean-named-entity / konec
View on GitHub
Korean Named Entity Corpus
☆25May 12, 2023Updated 3 years ago
Curt-Park / echo-grpc-triton
View on GitHub
Inference API server with echo and gRPC to triton server (golang)
☆13Nov 16, 2022Updated 3 years ago
ssu-humane / K-HATERS
View on GitHub
Hate speech detection corpus in Korean, shared with EMNLP 2023 paper
☆17Apr 19, 2024Updated 2 years ago
yuhongqian / ANCE-PRF
View on GitHub
☆12May 17, 2022Updated 4 years ago
facebookresearch / ketod
View on GitHub
KETOD Knowledge-Enriched Task-Oriented Dialogue
☆33Jan 4, 2023Updated 3 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
tlkh / t2t-tuner
View on GitHub
Convenient Text-to-Text Training for Transformers
☆18Dec 10, 2021Updated 4 years ago
LoicGrobol / decofre
View on GitHub
Neural coreference resolution
☆12Sep 3, 2024Updated last year
vinid / prodb
View on GitHub
☆18Sep 16, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kakao / kanana-2
View on GitHub
☆23Jun 30, 2026Updated 3 weeks ago
microsoft / PLOG
View on GitHub
☆23Jun 7, 2023Updated 3 years ago
google-research / precondition
View on GitHub
☆34Jul 9, 2026Updated 2 weeks ago
eubinean / k4ji_ai
View on GitHub
4명의 김씨, 한명의 진씨, 한명의 임씨가 모여서 인공지능을 공부하고 있습니다.
☆13Jun 30, 2021Updated 5 years ago
haven-jeon / KoGPT2-subtasks
View on GitHub
NSMC, KorSTS ... fine-tunings
☆18Feb 23, 2022Updated 4 years ago
MikeWangWZHL / Zemi
View on GitHub
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆15May 3, 2023Updated 3 years ago
yukyunglee / transformers-resources
View on GitHub
huggingface transformers tutorial, code, resources
☆26Apr 7, 2024Updated 2 years ago
AntoineSimoulin / pytree
View on GitHub
Implementation of tree-structured neural networks in PyTorch.
☆14Nov 15, 2021Updated 4 years ago
Tomiinek / Aargh
View on GitHub
☆12Jan 2, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
N-Almarwani / DCT_Sentence_Embedding
View on GitHub
Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform
☆17Jul 2, 2020Updated 6 years ago
sgugger / torchdynamo-tests
View on GitHub
☆20Nov 23, 2022Updated 3 years ago
Arborator / arborator-frontend
View on GitHub
VueJs based user interface for Arborator-flask
☆14Updated this week
damith92 / T5_encoder_decoder_prompt_tuning_for_text_generation
View on GitHub
The model implementations for T5 encoder decoder soft prompt tuning for text generation.
☆26Dec 5, 2022Updated 3 years ago
eubinean / idiomify
View on GitHub
Exploring the Efficacy of Idiomify: How Effective is GPT-3 for Teaching Idioms to EFL Writers?
☆16Aug 9, 2022Updated 3 years ago
LeeSureman / MoT
View on GitHub
code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts
☆24Nov 29, 2023Updated 2 years ago
anton-l / wav2vec-toolkit
View on GitHub
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆30Apr 21, 2021Updated 5 years ago
wisenut-research / KoT5
View on GitHub
한국어 T5 모델
☆56Dec 7, 2021Updated 4 years ago
hopsparser / hopsparser
View on GitHub
A neural dependency parser that does its best
☆17Mar 6, 2026Updated 4 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
nbroad1881 / strideformer
View on GitHub
Using short models to classify long texts
☆21Mar 8, 2023Updated 3 years ago
jjonescz / awe
View on GitHub
AI-based web extractor
☆12Feb 25, 2023Updated 3 years ago
tkhang1999 / semantic-food-search
View on GitHub
A semantic food search web application built with Django, Solr, SBERT, and Docker
☆10Apr 14, 2025Updated last year
paust-team / pko-t5
View on GitHub
bpe based korean t5 model for text-to-text unified framework
☆63Apr 17, 2024Updated 2 years ago
robhinds / opennlp-ingredient-finder
View on GitHub
A simple project that trains an OpenNLP Named Entity Recognition model to identify ingredients in a recipe.
☆14Oct 30, 2016Updated 9 years ago
hamanlp / hama-py
View on GitHub
🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer
☆19Feb 4, 2025Updated last year
mitramir55 / PassivePy
View on GitHub
PassivePy: A Tool to Automatically Identify Passive Voice in Big Text Data
☆23Mar 6, 2024Updated 2 years ago