sushant354 / egazette
A project to download and process gazettes (Govt Notifications) from India
☆22Updated this week
Alternatives and similar repositories for egazette:
Users that are interested in egazette are comparing it to the libraries listed below
- OASIS TC Open Repository: Schema files, examples, exemplificative implementations and libraries, and documentation related to the LegalDo…☆62Updated 2 years ago
- Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string compari…☆56Updated 6 years ago
- Laws of India in Akoma Ntoso XML format☆35Updated 6 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Transliteration module for Indian Languages☆77Updated last year
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- The Beneficial Ownership Data Standard (BODS) is an open standard providing a specification for modelling and publishing information on t…☆64Updated 3 weeks ago
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- Polygon Extents of Pincodes☆35Updated 7 years ago
- Resources to go with the Indic NLP Library☆73Updated 2 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated 10 months ago
- Anuvaad - Open Sourced Document Translation Platform for Indic Languages☆63Updated 4 months ago
- Named entity recognition for the legal domain☆41Updated 3 years ago
- PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz☆38Updated 11 months ago
- State of the art open-source translation for Indic languages.☆5Updated 3 years ago
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated 2 years ago
- framework for scraping legislative/government data☆85Updated 5 months ago
- PIN Code mapping database☆55Updated 13 years ago
- Create a Geonames gazetteer index in Elasticsearch☆74Updated last year
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- A directory of Indic (Indian) language computing resources.☆56Updated last year
- Link Wikidata items to large catalogs☆95Updated 2 weeks ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated 11 months ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- HOCR Specification Python Parser☆13Updated 9 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆41Updated 2 years ago
- Reading legal authority for the last time☆34Updated this week
- Events and Situations Ontology☆13Updated 6 years ago
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Updated 9 years ago