indic-transliteration / sanscript.phpLinks
Transliteration package for Indian scripts
☆16Updated 8 years ago
Alternatives and similar repositories for sanscript.php
Users that are interested in sanscript.php are comparing it to the libraries listed below
Sorting:
- Test data for snowball stemming algorithms☆34Updated last month
- ISO Language Codes (639-1 and 639-2)☆101Updated 8 months ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆69Updated 3 weeks ago
- A client side search engine for use on static pages.☆139Updated 7 years ago
- A generator for a tree browser for categories of validated Wikisource works, in multiple languages.☆8Updated 3 years ago
- Transliteration module for Indian Languages☆78Updated last year
- Development of http://www.sanskrit-lexicon.uni-koeln.de/☆18Updated 3 months ago
- Code and data used in named entity transliteration experiments☆57Updated 7 years ago
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 3 years ago
- It finds best synonyms from Google Books when you press a hotkey☆30Updated 10 years ago
- stoplists for African languages generated from the ASP corpus☆14Updated 9 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- Helsinki Finite-State Technology (library and application suite)☆133Updated last month
- Read Web ARChive (WARC) files in PHP.☆21Updated 8 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 7 years ago
- Transliteration data and models☆56Updated 8 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Lexical data at Unicode☆68Updated 10 months ago
- Wiktionary Parser☆28Updated 8 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Updated 11 years ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆49Updated 2 weeks ago
- Offline Nepali Handwritten Character Recognition Using Artificial Neural Networks☆23Updated 8 years ago
- Automatically exported from code.google.com/p/guess-language☆53Updated last year
- Download SoundCloud artists in parallel☆10Updated last month
- Manifests of the public domain images uploaded to Flickr Commons, with descriptive information about the books they were taken from.☆75Updated 11 years ago
- A generic, machine learning-based revision scoring system for MediaWiki☆90Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 4 years ago
- Easy language identification of 380 languages☆17Updated 5 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year