CoFiF / Corpus
The first French corpus comprising financial reports
☆13Updated 4 years ago
Alternatives and similar repositories for Corpus:
Users that are interested in Corpus are comparing it to the libraries listed below
- Dutch abusive language data☆11Updated last year
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- Comprehensive Python library for speech and voice.☆33Updated 2 years ago
- Applying Snorkel to SuperGLUE☆23Updated 5 years ago
- ☆17Updated 6 months ago
- Bots for reviewing the credibility of web content: articles, tweets, sentences and websites☆9Updated 2 years ago
- ☆22Updated 7 months ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Updated 5 years ago
- Instance Neighbouring by using Knowledge☆16Updated 4 months ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆12Updated 7 months ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- python package for calculating famous measures in computational linguistics☆13Updated 3 months ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 6 years ago
- BirdSpotter is a python package which provides an influence and bot detection toolkit for twitter.☆19Updated 3 years ago
- Weighted multiple-instance learning algorithm based on stochastic gradient descent☆11Updated 5 years ago
- A list of notes about NLP papers☆36Updated 6 years ago
- Jupyter notebook widget to quickly label text data☆47Updated 6 years ago
- Scripts for ECML PKDD 2018 article: Similarity encoding for learning with dirty categorical variables☆11Updated 6 years ago
- Quantlets of textmining projects☆12Updated 3 weeks ago
- Automatically modelling and distilling knowledge within AI. In other words, summarising the AI research firehose.☆21Updated 5 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆11Updated 3 years ago
- Model for learning document embeddings along with their uncertainties☆35Updated last year
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Combining encoder-based language models☆11Updated 3 years ago
- Introduction Notebook to Extreme Multi-Label Classification problem (XML)☆22Updated 6 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 4 years ago