dizzylogicc / WikiParser
Fast C++ based parser for English Wikipedia
☆15Updated 3 years ago
Alternatives and similar repositories for WikiParser:
Users that are interested in WikiParser are comparing it to the libraries listed below
- 🎲 An efficient implementation of a probabilistic Context Free Grammar parser in Javascript☆55Updated 2 years ago
- Examples of intrusive container templates in C++.☆9Updated 6 years ago
- SALM: Suffix Array and its Applications in Empirical Language Processing by Joy☆11Updated 7 years ago
- A library for generating compile time parsers parsing embedded DSL code as part of the C++ compilation process☆43Updated 2 months ago
- C++ library for bit twiddling☆37Updated 6 years ago
- Software Language Processing Suite☆45Updated 3 years ago
- String Matching Algorithms Research Tool☆99Updated 10 months ago
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆33Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆67Updated 2 weeks ago
- Compile-time TRIE based string matcher (C++11)☆52Updated 5 years ago
- FoLiA library for C++☆16Updated 3 weeks ago
- ☆21Updated 8 years ago
- C++ metaprogramming shell☆23Updated last year
- A library for reusable parsers☆16Updated 4 years ago
- SymSpell C++ Ports☆31Updated 6 years ago
- LALR(1) parser for C++☆78Updated 7 months ago
- *Unofficial* mirror of https://bitbucket.org/MDukhan/yeppp☆38Updated 8 years ago
- Fast fuzzy regex matcher: specify max edit distance to find approximate matches. FuzzyMatcher is now included in RE/flex.☆36Updated 2 weeks ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆126Updated 2 months ago
- PPM compressor with high compression ratio.☆32Updated 6 years ago
- devector and batch_deque containers for C++. See more at: http://erenon.hu/double_ended☆15Updated 7 years ago
- zero-copy, zero-serialize, zero-hassle protocol buffers☆56Updated 7 years ago
- Boost.org random module☆35Updated this week
- C++17 implementation of memory-efficient dynamic tries☆58Updated 3 years ago
- JAXN: A standard for extended JSON☆19Updated 3 years ago
- ☆14Updated 9 years ago
- Learning Based Java (LBJava)☆13Updated 2 years ago
- Lightweight C++ translator for OpenNMT Torch models (deprecated)☆79Updated 4 years ago
- Rolling Hash C++ Library☆188Updated 11 months ago