etalab-ia / piaf-code
This repo contains the code used to generate the French Wikipedia sample used in the QA annotation project PIAF
☆11Updated 3 years ago
Related projects: ⓘ
- French Machine Reading for Question Answering☆18Updated last year
- OpenNeuroSpell contains parts of NeuroSpell (http://neurospell.com/en.php) released as open-source. More code will be published as soon a…☆20Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 3 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆85Updated 2 months ago
- ✨ Web interface for NeuralCoref coreference resolution☆34Updated last year
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆85Updated 3 years ago
- Question Answering annotation platform - Plateforme d'annotation☆87Updated 3 years ago
- ☆64Updated last year
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆69Updated last year
- No Teacher BART distillation experiment for NLI tasks☆25Updated 3 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- A web interface to understand language-specific BERT-models☆17Updated 5 months ago
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35Updated 7 years ago
- sequence tagging with spaCy and crfsuite☆18Updated last year
- Python SDK for the TextRazor Text Analytics API☆20Updated last year
- The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".☆22Updated 3 years ago
- numeric fused-head identification and resolution☆33Updated 4 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated last year
- Text Similarity Search Application using Modern NLP and Elasticsearch☆29Updated 4 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 2 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Updated 7 months ago
- A simple neural truecaser written in pytorch and allennlp.☆31Updated 3 months ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆72Updated 2 months ago
- communication sur le moteur de pseudonymisation de la Cour de Cassation☆17Updated last year