stared / which-ml-are-you
Which ML are you?
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for which-ml-are-you
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆20Updated 4 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 4 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Updated 2 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Easily deploy a state-of-the-art language model from HuggingFace's Transformers☆12Updated 4 years ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 2 years ago
- ☆18Updated 9 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- A web interface to understand language-specific BERT-models☆17Updated 7 months ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- An extension package of 🤗 Datasets that provides support for executing arbitrary SQL queries on HF datasets☆31Updated 9 months ago
- ☆22Updated 2 years ago
- Deep Learning for Language Workshop prepared for the AI For Social Good Summer Lab, 2018☆23Updated 2 years ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper☆11Updated last year
- The stand-alone training engine module for the ALOHA.eu project.☆15Updated 5 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Code for my blog post on Generating Words from Embeddings☆23Updated 3 months ago
- An easy way to start a python programming environment using GitHub Codespaces.☆15Updated 4 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆12Updated 3 months ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated last year
- Experiments with Hugging Face 🔬 🤗☆45Updated 3 months ago
- Neural Machine Translation for South African Languages☆37Updated last year
- Tooling to play around with multilingual machine translation for Indian Languages.☆21Updated 2 years ago
- Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study th…☆16Updated 8 months ago
- Code for "Re-evaluating Word Mover’s Distance" (ICML 2022)☆38Updated 2 years ago