Generate large textual corpora for almost any language by crawling the web
☆13Feb 17, 2024Updated 2 years ago
Alternatives and similar repositories for webcorpus
Users that are interested in webcorpus are comparing it to the libraries listed below
Sorting:
- Agile reading group that works☆13Feb 2, 2022Updated 4 years ago
- ☆17Apr 28, 2021Updated 4 years ago
- ☆23May 5, 2022Updated 3 years ago
- ☆45Dec 15, 2022Updated 3 years ago
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆31Aug 14, 2025Updated 6 months ago
- ☆38Feb 8, 2026Updated 3 weeks ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆38Jul 24, 2025Updated 7 months ago
- Python library for converting numbers to words for all Indian Languages.☆36May 23, 2025Updated 9 months ago
- A Catalog lists instruction sets, models available for Indic language☆10Mar 14, 2024Updated last year
- Web app for generating Freshworks Peer Appreciation cards.☆11Jan 24, 2026Updated last month
- Android Application that allow the user to locate his position using the wifi. Once the localization is done the user can track his move …☆11Oct 21, 2016Updated 9 years ago
- ☆12Jan 11, 2023Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆110Aug 28, 2025Updated 6 months ago
- React hook which allows to put functions in a queue in offline mode☆11Jan 4, 2023Updated 3 years ago
- Lift-style CSS selector transforms based on Scalate's Scuery☆10Aug 23, 2012Updated 13 years ago
- ☆12Sep 20, 2014Updated 11 years ago
- Generate an appropriate question from a passage.☆10May 18, 2021Updated 4 years ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- ☆11Mar 19, 2023Updated 2 years ago
- Synthetically generate random text document images with ground-truth☆12Jul 20, 2021Updated 4 years ago
- Parse Searchable Electoral Rolls☆11Apr 20, 2025Updated 10 months ago
- This is Android lib to help all projects can easy integrate Camera with face detection feature.☆10Apr 25, 2016Updated 9 years ago
- ☆13Nov 4, 2025Updated 4 months ago
- simple project to change language (Locale) of application in run time☆12Oct 1, 2020Updated 5 years ago
- ☆15Apr 26, 2025Updated 10 months ago
- Detection of malicious data exfiltration over DNS using Machine Learning techniques☆13Jul 8, 2020Updated 5 years ago
- ClawBands is a security middleware for OpenClaw AI agents.☆164Feb 9, 2026Updated 3 weeks ago
- ☆13Jan 22, 2025Updated last year
- Persistent memory system for agentic AI via MCP - remember, recall, forget with semantic search with knowledge graph☆28Feb 15, 2026Updated 2 weeks ago
- Real-time group chat application built for Android using Firebase.☆11Dec 6, 2016Updated 9 years ago
- Code for Interspeech2022 paper DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion☆13May 6, 2023Updated 2 years ago
- utilities for generating website graphics like gradients and textures☆34Jul 15, 2014Updated 11 years ago
- 低内存消耗的序列帧动画库,只占用一张序列帧图片的内存,可用于直播大 礼物的展示☆10Oct 16, 2018Updated 7 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Nov 22, 2021Updated 4 years ago
- a tutorial on writing web applications with the Pinax framework☆21Dec 23, 2009Updated 16 years ago
- iPhone control for using PhoneGap as a part of your application☆22Apr 22, 2010Updated 15 years ago
- ☆15Mar 2, 2025Updated last year
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 2 years ago
- Bytebeat player with a collection of many formulas from around the internet.☆20Dec 2, 2025Updated 3 months ago