This will download and process the Google Ngram data.
☆25Nov 29, 2022Updated 3 years ago
Alternatives and similar repositories for raw-data-google-ngram
Users that are interested in raw-data-google-ngram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆109Aug 14, 2023Updated 2 years ago
- Tools in python for dealing with Google Books Ngram files and other similar data sets.☆19May 7, 2014Updated 12 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- ☆18May 11, 2021Updated 5 years ago
- Data on verb transitivity in English and script to extract transitivity information from Google's syntactic ngrams corpus☆12Oct 1, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A LibreOffice extension that converts JabRef references to plain text code and vice versa so that you can use your references with MS Off…☆13Aug 15, 2024Updated last year
- ☆98Aug 1, 2021Updated 4 years ago
- Python implementation of "How quantifying the shape of stories predicts their success" by Toubia et al.☆11Jan 6, 2023Updated 3 years ago
- Dialog2Flow: convert your dialogs to flows. This repository accompanies the paper "Dialog2Flow: Pre-training Soft-Contrastive Sentence Em…☆20Jul 1, 2025Updated 11 months ago
- All the words from Google Books, sorted by frequency☆126Jul 4, 2023Updated 2 years ago
- Scrapes Google Books Ngram data to create a long word list☆14Feb 24, 2024Updated 2 years ago
- Software for multi-level annotation of linguistic corpora☆17Jan 15, 2020Updated 6 years ago
- Shell scripts to assist downloading & processing the Google n-grams corpora☆13Apr 26, 2017Updated 9 years ago
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Jul 13, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Neural morphological disambiguation for Turkish. Implemented in DyNet☆11Sep 12, 2019Updated 6 years ago
- ☆12Apr 2, 2026Updated 2 months ago
- Automated Semantic Analysis of Discourse Markers☆11May 30, 2022Updated 4 years ago
- NetLogo Code for Modeling Social Behavior by Paul Smaldino☆45Jul 31, 2024Updated last year
- Sentence generation system for evaluating composition, described in Ettinger et al. (2018) "Assessing Composition in Sentence Vector Repr…☆16Apr 25, 2020Updated 6 years ago
- This code accompanies the paper "Information-Theoretic Probing for Linguistic Structure" published in ACL 2020.☆21Apr 27, 2020Updated 6 years ago
- A new Turkish Dependency Treebank in UD style☆16Aug 17, 2020Updated 5 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Jun 17, 2024Updated 2 years ago
- A Python script to convert vobsub subtitles into srt format using tesseract for ocr☆10Sep 28, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- atmaCup #11 の Public 4th / Private 5th Solution のリポジトリです。☆12Aug 3, 2021Updated 4 years ago
- Entitypedia is an Extended Named Entity Dictionary from Wikipedia.☆13Dec 7, 2022Updated 3 years ago
- ☆14Jan 4, 2026Updated 5 months ago
- Everyone can be Pictogram-san☆17Aug 6, 2021Updated 4 years ago
- ☆15Sep 15, 2019Updated 6 years ago
- ☆12Jul 26, 2016Updated 9 years ago
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- Pre-training character n-gram embeddings☆23Nov 1, 2023Updated 2 years ago
- Boğaziçi University Annotation Tool for Dependency Parsing☆14Oct 31, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and iss…☆23May 5, 2026Updated last month
- Edit and create Kubernetes job from cronjob template using your EDITOR☆18Apr 8, 2025Updated last year
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆18Oct 19, 2020Updated 5 years ago
- A neural text style transfer model☆12Jun 23, 2019Updated 6 years ago
- React app that highlights relevant segments in a PDF document based on user questions using natural language processing and AI context se…☆11May 18, 2023Updated 3 years ago
- Data from the Sequoia treebank.☆11May 6, 2026Updated last month
- A simple module for updating zotflies directories.☆20Sep 11, 2020Updated 5 years ago