Developing tools to automatically analyze datasets
☆75Oct 29, 2024Updated last year
Alternatives and similar repositories for data-measurements-tool
Users that are interested in data-measurements-tool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- 3rd party dependencies for DALI project☆11Mar 10, 2026Updated 2 weeks ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆16Jan 16, 2024Updated 2 years ago
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.☆14Mar 9, 2022Updated 4 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 5 months ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆59Apr 6, 2023Updated 2 years ago
- Hugging Face's Zapier Integration 🤗⚡️☆50Apr 12, 2023Updated 2 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Jan 4, 2023Updated 3 years ago
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆17Apr 19, 2024Updated last year
- ☆12May 17, 2022Updated 3 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- ☆23Jun 7, 2023Updated 2 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆35Jan 5, 2026Updated 2 months ago
- ☆12Jan 2, 2024Updated 2 years ago
- huggingface transformers tutorial, code, resources☆26Apr 7, 2024Updated last year
- NSMC, KorSTS ... fine-tunings☆18Feb 23, 2022Updated 4 years ago
- Yet another python binding for mecab-ko☆88May 16, 2023Updated 2 years ago
- Exploring the Efficacy of Idiomify: How Effective is GPT-3 for Teaching Idioms to EFL Writers?☆16Aug 9, 2022Updated 3 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 2 years ago
- ☆20Nov 23, 2022Updated 3 years ago
- code associated with paper "Sparse Bayesian Optimization"☆26Oct 31, 2023Updated 2 years ago
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.☆25Dec 5, 2022Updated 3 years ago
- 4명의 김씨, 한명의 진씨, 한명의 임씨가 모여서 인공지능을 공부하고 있습니다.☆13Jun 30, 2021Updated 4 years ago
- ☆10Oct 6, 2021Updated 4 years ago
- 한국어 T5 모델☆55Dec 7, 2021Updated 4 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Jul 2, 2020Updated 5 years ago
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- Viewer for the 🤗 datasets library.☆86Jul 30, 2021Updated 4 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Apr 21, 2021Updated 4 years ago
- Evaluate Transformers from the Hub 🔥☆14Nov 27, 2023Updated 2 years ago
- A simple project that trains an OpenNLP Named Entity Recognition model to identify ingredients in a recipe.☆14Oct 30, 2016Updated 9 years ago
- ☆16Apr 24, 2024Updated last year
- 🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer☆19Feb 4, 2025Updated last year
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated last year
- ☆19Jan 29, 2023Updated 3 years ago
- Scaling Data-Constrained Language Models☆342Jun 28, 2025Updated 8 months ago