ntropy-network / ML-tools
ML tools that we use internally and which you may find useful too.
☆23Updated 2 years ago
Related projects: ⓘ
- Using short models to classify long texts☆20Updated last year
- A set of methods for finding an appropriate number of topics in a text collection☆14Updated last month
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- ☆22Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆77Updated 6 months ago
- PyTorch implementation for MRL☆17Updated 6 months ago
- NLP Examples using the 🤗 libraries☆42Updated 3 years ago
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆13Updated last year
- Repository for my master thesis on automated string handling☆16Updated 3 years ago
- ☆17Updated last month
- Generating Training Data Made Easy☆43Updated 4 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 7 months ago
- ☆75Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆91Updated last year
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- ☆20Updated 3 years ago
- ☆27Updated last year
- Implementation of N-Grammer in Flax☆16Updated last year
- Public repository holding examples for dataheroes library☆19Updated 2 weeks ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated last year
- ☆41Updated last year
- ☆30Updated 2 years ago
- ☆31Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated last year
- My explorations into editing the knowledge and memories of an attention network☆34Updated last year