MiniXC / opensubtitles-dataloaderLinks
Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.
☆13Updated 5 years ago
Alternatives and similar repositories for opensubtitles-dataloader
Users that are interested in opensubtitles-dataloader are comparing it to the libraries listed below
Sorting:
- NoPdb: Non-interactive Python Debugger☆84Updated 3 years ago
- Question Generation - Question Answering for Automatic Flashcards☆66Updated 3 years ago
- Test prompts for GPT-J-6B and the resulting AI-generated texts☆53Updated 4 years ago
- Conversational text Analysis using various NLP techniques☆182Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- A slightly opinionated iPython profile for interactive development☆23Updated 3 years ago
- 🕊️ Radically lightweight command-line interfaces☆108Updated 3 months ago
- An opinionated, organized way to start and manage data science experiments.☆15Updated 5 years ago
- Run compute jobs on AWS as if you were running them locally.☆124Updated 4 years ago
- A utility for labeling clusters of text data.☆28Updated 4 years ago
- A corpus of Python programs annotated with contracts☆25Updated 2 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆56Updated 4 years ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆50Updated 4 years ago
- Vectory provides a collection of tools to track and compare embedding versions.☆71Updated 3 years ago
- Visual Automata is a Python 3 library built as a wrapper for the Automata library to add more visualization features.☆57Updated 2 years ago
- Weird A.I. Yankovic neural-net based lyrics parody generator☆84Updated 3 years ago
- Flenser is a simple, minimal, automated exploratory data analysis tool.☆78Updated 7 months ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 6 years ago
- ⦠ Angle: new speakable syntax for python 💡☆132Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 3 years ago
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- Confection: the sweetest config system for Python☆192Updated last week
- Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.☆39Updated 5 years ago
- ☆70Updated 3 years ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- Abydos NLP/IR library for Python☆193Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- Language detection using Spacy and Fasttext☆57Updated 2 years ago
- A [personal]<-[notebook]->[network]. Complete with custom numerics for constrained Gaussian gravitation physics.☆22Updated 3 years ago