simple bs4 based web crawl for a corpus in need of statistical machine translation
☆13Aug 17, 2021Updated 4 years ago
Alternatives and similar repositories for crawl-for-parallel-corpora
Users that are interested in crawl-for-parallel-corpora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆45Aug 2, 2018Updated 7 years ago
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆17Jun 4, 2025Updated 9 months ago
- The set of files used for the development of the Amharic Corpus.☆11Jun 4, 2017Updated 8 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Jan 31, 2026Updated last month
- Morphological processing for languages of the Horn of Africa☆57Dec 27, 2025Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repository contains publicly available speech and text data in Luganda.☆12Sep 4, 2020Updated 5 years ago
- doddle-model code examples☆19Sep 23, 2019Updated 6 years ago
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆37May 27, 2023Updated 2 years ago
- Command-line corpus tools☆12May 15, 2017Updated 8 years ago
- An OCR engine that works by finding pre-known letters in a word's image☆12Jul 29, 2019Updated 6 years ago
- Music chords made easy☆15Oct 21, 2015Updated 10 years ago
- A collection of files and patterns to improve PostgreSQL text search☆11Aug 26, 2016Updated 9 years ago
- Language checker and hyphenator extension for LibreOffice☆12Jan 27, 2020Updated 6 years ago
- Lexical Data of Ge'ez Languages☆56Sep 14, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A module for creating stopword lists for any language, based on a set of documents.☆15Mar 20, 2026Updated last week
- Hacked Video Camera GoXtreme Wifi Control PTP/RTSP/FTP☆17Apr 5, 2015Updated 10 years ago
- Scorer for grammatical error correction systems.☆14Feb 24, 2016Updated 10 years ago
- Python application, generating parallel corpus for any language pairs, can be used for training nmt (Neural Machine Translation) systems☆12Dec 8, 2022Updated 3 years ago
- Easier analysis of large speech corpora☆23Jun 22, 2021Updated 4 years ago
- This project is aimed at providing an extendable soundmodem backend to various java applications. It is now in its initial phase providin…☆19Oct 9, 2011Updated 14 years ago
- OpenTelemetry - metrics, traces, logs from Grafana visualized in the Grafana☆24Nov 20, 2025Updated 4 months ago
- A platform for agriculture smart contracts based on the NEO blockchain.☆33Nov 26, 2019Updated 6 years ago
- Multi-label aviation safety narratives classification☆15Jan 29, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆11May 10, 2024Updated last year
- How To Be a Programmer, edited☆12May 21, 2012Updated 13 years ago
- 🏪 GatsbyJS + Shopify + Netlify CMS Starter + Vegefoods theme by COLORLIB☆10Jan 11, 2023Updated 3 years ago
- A NSSpellServer that forwards requests to LanguageTool for grammar checking☆21Jan 12, 2014Updated 12 years ago
- Statistical spell- and (occasional) grammar-checker.☆18Nov 20, 2024Updated last year
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Jul 27, 2020Updated 5 years ago
- ☆16Dec 11, 2019Updated 6 years ago
- Oromo, Amharic, English KJV bible API.☆15Jul 31, 2023Updated 2 years ago
- Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.☆19Oct 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Indonesian Chatbot built by Multi Layer Perceptron(Neural Network)☆42May 22, 2022Updated 3 years ago
- Pharmacy Management System☆13Aug 5, 2017Updated 8 years ago
- Winners solutions for [WNS Analytics Wizard 2018](https://datahack.analyticsvidhya.com/contest/wns-analytics-hackathon-2018/)☆25Dec 13, 2018Updated 7 years ago
- This Challenge aims to infer important COVID-19 public health risk factors from outdated data in South Africa☆20Dec 8, 2022Updated 3 years ago
- A simple chatty bot using react-native and Dialogflow☆10May 25, 2018Updated 7 years ago
- The purpose of this project is to address the design and implementation of an intelligent traffic light system based on fuzzy logic techn…☆23Jan 13, 2020Updated 6 years ago
- ☆32Dec 5, 2025Updated 3 months ago