Unsupervised Language Model Pre-training for French
☆247Apr 11, 2023Updated 2 years ago
Alternatives and similar repositories for Flaubert
Users that are interested in Flaubert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆156Feb 16, 2023Updated 3 years ago
- A repository of instructions in French to fine-tune LLMs☆17Jun 23, 2023Updated 2 years ago
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- ☆16Dec 8, 2022Updated 3 years ago
- SEM, a free NLP tool relying on machine learning technologies, especially CRFs.☆23Dec 1, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35May 18, 2017Updated 8 years ago
- Disambiguate is a tool for training and using state of the art neural WSD models☆60Jul 12, 2025Updated 8 months ago
- French word embeddings from series sub-titles☆22Sep 2, 2018Updated 7 years ago
- 📧 Melusine: Use python to automatize your email processing workflow☆363Feb 26, 2026Updated last month
- McKernel: A Library for Approximate Kernel Expansions in Log-linear Time.☆13Sep 3, 2022Updated 3 years ago
- 🇧🇪 BelGPT-2: the 1st GPT model pretrained in French.☆34Feb 24, 2021Updated 5 years ago
- R package for Byte Pair Encoding based on YouTokenToMe☆16Sep 5, 2025Updated 6 months ago
- ✒️ Cedille is a large French language model (6B), released under an open-source license☆204Feb 9, 2022Updated 4 years ago
- Weighted multiple-instance learning algorithm based on stochastic gradient descent☆12Feb 22, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆43Jan 3, 2022Updated 4 years ago
- The stand-alone training engine module for the ALOHA.eu project.☆15Oct 27, 2019Updated 6 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Oct 19, 2019Updated 6 years ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Jan 4, 2021Updated 5 years ago
- Semeval-2021 Multilingual and Cross-lingual Word-in-Context Task☆18May 27, 2021Updated 4 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Apr 1, 2024Updated last year
- A python tool for building large scale Wikipedia-based Information Retrieval datasets☆46Apr 28, 2021Updated 4 years ago
- ☆25Dec 15, 2025Updated 3 months ago
- A simple frontend for https://github.com/etalab/csvapi☆38Feb 13, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆20May 29, 2016Updated 9 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- Programming for Historians☆17Sep 12, 2022Updated 3 years ago
- Small examples showing how to use Odin for various IE tasks☆16Jun 1, 2017Updated 8 years ago
- pytorch attentional NMT(with NLL, MRT, REINFORCE, MIXER training objectives)☆13May 12, 2017Updated 8 years ago
- WiNER-fr is a free named entity corpus using French Wikinews texts.☆17Feb 12, 2021Updated 5 years ago
- Efficient learning of word representations☆22Feb 15, 2021Updated 5 years ago
- Command line tool using crossref.org's API to search DOIs and obtain formatted citations such as bibtex, apa, and a lot more☆15Oct 23, 2014Updated 11 years ago
- Matrix tools for building and inspecting latent spaces☆27Aug 19, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Initier la mise à disposition, pour tout citoyen, de techniques d’Intelligence Artificielle destinées à appréhender le nombre important d…☆11Aug 20, 2024Updated last year
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Aug 12, 2024Updated last year
- WordNet-LMF formats☆27Feb 4, 2026Updated last month
- A word2vec negative sampling implementation with correct CBOW update.☆261Nov 8, 2021Updated 4 years ago
- T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf☆19May 28, 2025Updated 10 months ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Feb 2, 2023Updated 3 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆89Dec 9, 2020Updated 5 years ago