orgtre/google-books-ngram-frequency

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/orgtre/google-books-ngram-frequency)

orgtre / google-books-ngram-frequency

Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code

☆111

Alternatives and similar repositories for google-books-ngram-frequency

Users that are interested in google-books-ngram-frequency are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ngrams-dev / general
View on GitHub
NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and iss…
☆23May 5, 2026Updated 2 months ago
piantado / ngrampy
View on GitHub
Tools in python for dealing with Google Books Ngram files and other similar data sets.
☆19May 7, 2014Updated 12 years ago
0xVavaldi / RuleOptimizer-CUDA-hashcat
View on GitHub
An application that optimizes Hashcat rules using set coverage optimization theory based on rule performance.
☆21May 18, 2026Updated 2 months ago
dimazest / google-ngram-downloader
View on GitHub
☆98Aug 1, 2021Updated 4 years ago
sts10 / common_word_list_maker
View on GitHub
Scrapes Google Books Ngram data to create a long word list
☆14Feb 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hashpwn / rules
View on GitHub
Top hashpwn rules
☆21Dec 12, 2025Updated 7 months ago
0xVavaldi / ruleprocessorY
View on GitHub
Rule Processor Y is a next-gen Rule processor with complex multibyte character support built to support Hashcat
☆39Nov 25, 2025Updated 8 months ago
0xVavaldi / gramify
View on GitHub
Create n-grams of wordlists based on words, characters, or charsets to use in offline password attacks and data analysis
☆34Jun 27, 2024Updated 2 years ago
ottiram / MMAX2
View on GitHub
Official repo of the MMAX2 annotation tool
☆14May 24, 2023Updated 3 years ago
ephialtes-t / shenbao-metadata
View on GitHub
☆12Aug 24, 2022Updated 3 years ago
blcuicall / YACLC
View on GitHub
Yet Another Chinese Learner Corpus
☆77Jan 10, 2022Updated 4 years ago
dadevel / shells
View on GitHub
Collection of Reverse, Bind & Web Shells
☆16Mar 25, 2026Updated 4 months ago
NathanDuran / CAMS-Dialogue-Annotation
View on GitHub
Label dialogue with Dialogue Acts and Adjacency Pairs
☆12Jun 20, 2023Updated 3 years ago
ret2src / OSCP-Exam-Report-Template-Markdown
View on GitHub
Markdown Templates for Offensive Security OSCP, OSWE, OSCE, OSEE, OSWP exam report
☆10Nov 28, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Scorpion-Security-Labs / OpenHashAPI
View on GitHub
OpenHashAPI provides a secure method of communicating hashes and enables lightweight workflows for security practitioners and enthusiasts…
☆13Oct 27, 2024Updated last year
DanaEpp / APIDiscovery
View on GitHub
An extension for Burp's Web Vulnerability Scanner that can detect API discovery metadata and extract data useful during recon.
☆19Sep 13, 2025Updated 10 months ago
0xVavaldi / Targinator
View on GitHub
A combinator tool
☆15Jul 25, 2025Updated 11 months ago
pry0cc / lazy
View on GitHub
This is a lazy enumeration script made to make bug bounty enum & pentest flyovers easy as cake!
☆13Jun 13, 2020Updated 6 years ago
Cynosureprime / rulechef
View on GitHub
a markov based rule generator for hashcat/mdxfind/jtr
☆25Dec 8, 2025Updated 7 months ago
cisnlp / parcoure
View on GitHub
ParCourE - Parallel Corpus Explorer
☆12Dec 27, 2021Updated 4 years ago
molangning / fire-av
View on GitHub
Fire-AV is a collection of lists that you can use to block av providers and bad ips
☆24Updated this week
yobabyte / decryptocollection
View on GitHub
A personal collection of scripts for decrypting various things.
☆21Feb 20, 2023Updated 3 years ago
ivre / masscan
View on GitHub
IVRE's fork of the famous TCP port scanner. See below for details.
☆39Jan 28, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
biu9 / cc98-summary
View on GitHub
☆13Apr 2, 2026Updated 3 months ago
blculyn / The-spoken-L1-corpus
View on GitHub
The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…
☆23Aug 2, 2021Updated 4 years ago
Wh04m1001 / GamingServiceEoP5
View on GitHub
☆30May 16, 2024Updated 2 years ago
byungdoh / llm_surprisal
View on GitHub
Surprisal calculation using HuggingFace LMs ("Frequency Explains the Inverse Correlation of Large Language Models’ Size, Training Data Am…
☆23Mar 7, 2024Updated 2 years ago
liamg / clinch
View on GitHub
Go CLI toolkit
☆20Dec 18, 2023Updated 2 years ago
badsectorlabs / ludus_adcs
View on GitHub
An Ansible Role that installs ADCS on Windows Server and optionally configures Certified Preowned templates.
☆23Mar 20, 2026Updated 4 months ago
ziyin-dl / ngram-word2vec
View on GitHub
☆18May 11, 2021Updated 5 years ago
kristopherkyle / TAASSC
View on GitHub
Tool for the Automatic Analysis of Syntactic Sophistication and Complexity
☆31Nov 4, 2023Updated 2 years ago
Reconfirefly / drugwars
View on GitHub
recreation of the classic drug trading game "dope wars"
☆11May 9, 2019Updated 7 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Anterotesis / historical-texts
View on GitHub
Collections of english historical texts and data relating to them
☆19Mar 24, 2021Updated 5 years ago
aqilc / cozyweb
View on GitHub
Single header C networking libraries for games and casual use.
☆23Jan 24, 2026Updated 6 months ago
rossellhayes / nombre
View on GitHub
Number names in R 2️⃣💬
☆14Mar 15, 2024Updated 2 years ago
expo / expo-webpack-integrations
View on GitHub
Packages used to integrate Expo in Webpack-based projects.
☆13Jan 17, 2024Updated 2 years ago
venkatapgummadi / ascend
View on GitHub
DevSecOps for the AI-era CI/CD pipeline. Catches the bugs your AI coding assistant introduces — before they reach production.
☆20Jul 14, 2026Updated last week
jorgeorchilles / presentations
View on GitHub
Slides and materials for conference presentations
☆11Jun 4, 2023Updated 3 years ago
RomelSan / hackers-dont-give-a-shit
View on GitHub
Hackers Don't Give A Shit
☆16Feb 2, 2020Updated 6 years ago