Simple, standalone python classes for training statistical language models using several popular smoothing methods.
☆25Nov 3, 2012Updated 13 years ago
Alternatives and similar repositories for SimpleLM
Users that are interested in SimpleLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of a HMM Ngram language model.☆11Mar 12, 2015Updated 11 years ago
- Geometry-aware Multilingual Embeddings☆26Dec 8, 2022Updated 3 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 6 years ago
- Python code to automatically produce a summary of a piece of text.☆12Sep 8, 2016Updated 9 years ago
- Reusable code for Python so I don't have to write the same thing twice!☆12Feb 1, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A post-processing method to create fair rankings wrt ranked group fairness☆15Jun 5, 2019Updated 6 years ago
- The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)☆33Sep 22, 2025Updated 6 months ago
- The AcousticBrainz Genre Dataset☆32Jan 30, 2023Updated 3 years ago
- Command-line corpus tools☆12May 15, 2017Updated 8 years ago
- Context Aware Language Models☆28Jul 3, 2018Updated 7 years ago
- Contextual Lemmatization and Morphological Tagging in 100 different languages. A Participant System for SigMorphon2019 Task 2☆24Jul 25, 2024Updated last year
- Kneser-Ney implementation in Python☆85Dec 31, 2015Updated 10 years ago
- Fast, accurate, lightweight, multi-core ML in Python, leveraging Vowpal Wabbit☆21May 26, 2018Updated 7 years ago
- Scripts for exporting Kaldi labeled data into TensorFlow☆12Jul 31, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆17Jul 22, 2021Updated 4 years ago
- FluRS: A Python library for streaming recommendation algorithms☆107Feb 8, 2022Updated 4 years ago
- Generalized Language Modeling toolkit☆52Jun 21, 2022Updated 3 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 8 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- Personal Infrastructure for Deep Learning based on Pytorch and Tensorflow☆10Jan 10, 2019Updated 7 years ago
- btcturk.com api client☆17Mar 26, 2019Updated 7 years ago
- ☆13Feb 20, 2020Updated 6 years ago
- Neural morphological disambiguation for Turkish. Implemented in DyNet☆11Sep 12, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Python binding for SRI Language Modeling Toolkit implemented in Cython☆30Jan 24, 2022Updated 4 years ago
- Example of how to use an ViewSwitcher to switch between two ImageView objects☆13Dec 16, 2012Updated 13 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- finding set bits in large bitmaps☆15Nov 30, 2015Updated 10 years ago
- Wrapping of Bootstrap CSS as Polymer dom-module to be used as shared styles☆10Aug 9, 2017Updated 8 years ago
- Lemmatizer for indonesian language☆12Mar 19, 2013Updated 13 years ago
- A really fast document ranking engine using BM25 and TF-IDF. Based on Python using NLP packages NLTK and spacY.☆16May 8, 2018Updated 7 years ago
- Online Bootcamp Student Project Presentation☆14Oct 22, 2017Updated 8 years ago
- A dataset of semantically related sentence pairs in the German legal domain☆10Feb 26, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- SIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model☆36Aug 2, 2017Updated 8 years ago
- Poetry Annotated with Rhyme Schemes☆25Nov 22, 2011Updated 14 years ago
- Hadoop MapReduce training of modified Kneser-Ney smoothed language models☆28Jun 12, 2018Updated 7 years ago
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Apr 8, 2022Updated 4 years ago
- MXNet/Gluon implement of L-GM-Loss☆11Oct 17, 2018Updated 7 years ago
- Y-Bot is the primary bot for use with Program-Y☆12May 26, 2021Updated 4 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago