MiniXC / opensubtitles-dataloaderLinks
Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.
β13Updated 5 years ago
Alternatives and similar repositories for opensubtitles-dataloader
Users that are interested in opensubtitles-dataloader are comparing it to the libraries listed below
Sorting:
- A python module for word inflections designed for use with spaCy.β93Updated 5 years ago
- ποΈ Radically lightweight command-line interfacesβ109Updated last month
- Conversational text Analysis using various NLP techniquesβ182Updated 2 years ago
- Abydos NLP/IR library for Pythonβ191Updated 2 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β68Updated 3 years ago
- Confection: the sweetest config system for Pythonβ191Updated 6 months ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBertβ¦β49Updated 4 years ago
- π€ Measure edit distance based on keyboard layoutβ61Updated 2 weeks ago
- NoPdb: Non-interactive Python Debuggerβ84Updated 3 years ago
- A utility for labeling clusters of text data.β28Updated 4 years ago
- β70Updated 2 years ago
- Loadable spellfix1 extension for sqlite as python packageβ26Updated last year
- Test prompts for GPT-J-6B and the resulting AI-generated textsβ53Updated 4 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.β53Updated 4 years ago
- negate_sentence(A Python module that doesn't negate sentences.)β31Updated last year
- A python package to simulate typographical errors.β37Updated last year
- A corpus of Python programs annotated with contractsβ24Updated last week
- Efficiently computing & storing token n-grams from large corporaβ26Updated last year
- π Make Thinc faster on macOS by calling into Apple's native Accelerate libraryβ101Updated 3 months ago
- β18Updated 3 years ago
- Lazy, a tool for running things in idle timeβ48Updated 4 years ago
- Small deep learning library written from scratch in Python, using NumPy/CuPy.β124Updated 3 years ago
- A slightly opinionated iPython profile for interactive developmentβ23Updated 3 years ago
- Vectory provides a collection of tools to track and compare embedding versions.β71Updated 2 years ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- Python wrapper for Ferretβ43Updated 3 years ago
- Question Generation - Question Answering for Automatic Flashcardsβ66Updated 3 years ago
- Custom Natural Language Processing with big and small models π²π±β66Updated 4 years ago
- Super lightweight function registries for your libraryβ180Updated last year
- A Python 3 phonetics library.β134Updated 5 years ago