Useful tools to extract malayalam text from the Common Crawl Datasets
☆28Apr 21, 2026Updated 2 weeks ago
Alternatives and similar repositories for common-crawl-malayalam
Users that are interested in common-crawl-malayalam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Language Modeling and Text Classification in Malayalam Language using ULMFiT☆73Dec 8, 2022Updated 3 years ago
- Index Common Crawl archives in tabular format☆128Updated this week
- Tensorflow implementation of Generative Adversarial Text to Image Synthesis for MNIST handwritten digit dataset☆10Aug 3, 2017Updated 8 years ago
- LiT (Zero-Shot Transfer with Locked-image text Tuning) image and text encoder models, working in the browser☆11May 16, 2022Updated 3 years ago
- TensorFlow implementation of "Generating Sentences from a Continuous Space"☆11Sep 16, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A linter for Scrapy projects.☆22Feb 25, 2026Updated 2 months ago
- Nupuram/നൂപുരം Font - https://smc.org.in/fonts/nupuram☆15Sep 29, 2023Updated 2 years ago
- A collection of models for TensorFlow Go☆12May 29, 2022Updated 3 years ago
- Face recognition using point cloud from LiDAR sensor☆12Mar 9, 2022Updated 4 years ago
- A Machine Learning tool to create the training dataset very quickly & easily by using a smart chrome extension☆14Feb 11, 2023Updated 3 years ago
- Adversarial Machine Translation with pytorch☆23Jan 14, 2018Updated 8 years ago
- ☆11Dec 10, 2020Updated 5 years ago
- ☆14May 10, 2024Updated 2 years ago
- Made for our SPAM. Features include Re vili Saying Thank you☆10Nov 27, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Sep 23, 2020Updated 5 years ago
- ☆40Jun 2, 2021Updated 4 years ago
- Common web archive utility code.☆63May 2, 2026Updated last week
- paper2code is a collection of AI/ML research papers rebuilt in Python — stripped of the abstractions that hide what's actually happening.☆23Apr 14, 2026Updated 3 weeks ago
- EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and E…☆42Jun 21, 2022Updated 3 years ago
- Deployment of pywb as a CommonCrawl Index Server☆21Oct 6, 2017Updated 8 years ago
- ☆18Dec 30, 2024Updated last year
- ☆19Sep 4, 2021Updated 4 years ago
- Advance Image Downloader/Extractor (Job) is a Python-Flask web-based app, which will help the user download the any kind of Images at any…☆14Sep 10, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repo contains self made projects and learnables from various resources on using local LLMs and RAG☆14May 26, 2025Updated 11 months ago
- Add website scraping abilities to Datasette☆66Mar 4, 2023Updated 3 years ago
- Manjari Malayalam Font.☆11Sep 29, 2023Updated 2 years ago
- An attempt to create a sensor fusion model for camera & laser scanner inputs for Autonomous Vehicles☆14Jul 25, 2021Updated 4 years ago
- ☆15Aug 18, 2021Updated 4 years ago
- Clean Water AI☆13Aug 15, 2018Updated 7 years ago
- ☆25Mar 20, 2024Updated 2 years ago
- Converter from Swagger JSON to Markdown☆12May 11, 2019Updated 6 years ago
- Baby deep learning library🐣☆14Jan 22, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- By integrating the data of LiDAR and camera, create teacher data sets for monocular camera.☆18Nov 14, 2018Updated 7 years ago
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆19Feb 19, 2024Updated 2 years ago
- covid question answering datasets and fine tuned models☆18Apr 27, 2021Updated 5 years ago
- Russian words synonyms and antonyms☆11Dec 7, 2021Updated 4 years ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Aug 22, 2022Updated 3 years ago