Preparing DMOZ dataset for my n-Gram LM-based URL classification research
☆31Aug 30, 2014Updated 11 years ago
Alternatives and similar repositories for dmoz-urlclassifier
Users that are interested in dmoz-urlclassifier are comparing it to the libraries listed below
Sorting:
- Algorithms for URL Classification☆19Apr 13, 2015Updated 10 years ago
- A database importer for the open directory project (aka dmoz) data☆20Sep 14, 2014Updated 11 years ago
- Implementation of a pairwise document similarity algorithm using MapReduce.☆15Nov 16, 2011Updated 14 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Feb 10, 2026Updated 3 weeks ago
- ☆16Sep 13, 2016Updated 9 years ago
- Detect and classify pagination links☆15Sep 9, 2020Updated 5 years ago
- Classifies webpages into categories defined in DMOZ dataset☆40Dec 14, 2015Updated 10 years ago
- RWA recurrent neural networks☆17Apr 14, 2017Updated 8 years ago
- Dmoz RDF parser☆28Jun 22, 2016Updated 9 years ago
- ☆11Aug 2, 2024Updated last year
- Repository for the CLiPS HAte speech DEtection System [HADES].☆24Apr 5, 2018Updated 7 years ago
- The Clever Algorithms project is an effort to describe a large number of algorithmic techniques from the field of Artificial Intelligence…☆29Oct 28, 2018Updated 7 years ago
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆41May 29, 2017Updated 8 years ago
- This tool emulates the modbus registers of a SDM630 (Single/Three Phase Power Meter) from Eastron (pymodbus, Raspberry). Useful for Growa…☆11May 17, 2025Updated 9 months ago
- ☆16Jul 7, 2025Updated 7 months ago
- Smart Meter Data Collector☆12Jul 17, 2024Updated last year
- ☆11Mar 1, 2021Updated 5 years ago
- Juery is a tiny Java library to manage search and filter query from user to database.☆12Jan 27, 2026Updated last month
- A SPICE-program funded project where the goal is to detect hate speech in social media.☆32Oct 29, 2017Updated 8 years ago
- An FFmpeg Wrapper with focus on Complex Filter☆11Jul 7, 2023Updated 2 years ago
- A simple UI client for Aerospike DB☆11Aug 3, 2023Updated 2 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Dec 17, 2021Updated 4 years ago
- An efficient approximation for tree edit-distance.☆45Sep 6, 2011Updated 14 years ago
- [mirror] Static website generator using `make` and `discount`.☆10Jun 30, 2020Updated 5 years ago
- VSCodium for LoongArch with system-wide Electron.☆12Dec 14, 2023Updated 2 years ago
- An Ansible Role that manages Hetzner Robot Firewall☆12Feb 13, 2020Updated 6 years ago
- ☆11Jan 12, 2020Updated 6 years ago
- Radio that switches stations during the day☆10Jun 27, 2023Updated 2 years ago
- Open Insights is a framework for constructing browser-based RUM clients.☆13Jan 6, 2023Updated 3 years ago
- A graphical EDA tool☆14Jan 9, 2023Updated 3 years ago
- A genetic algorithm for TSP that I wrote to use an example in my Stack Abuse article.☆10Jul 17, 2019Updated 6 years ago
- a Device Management Daemon☆13Jan 13, 2024Updated 2 years ago
- gobalan is a TCP load balancer that supports high network throughput and also support a special load balancing algorithm based on the mac…☆10Feb 12, 2020Updated 6 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Dec 28, 2017Updated 8 years ago
- ☆11Jan 20, 2021Updated 5 years ago
- Official implementation of the paper "Light Transport-aware Diffusion Posterior Sampling for Single View Reconstruction of Volumes"☆17Aug 1, 2025Updated 7 months ago
- This project contains simple methods to measure sample relatedness and identify potential swaps and contamination☆10Jan 8, 2016Updated 10 years ago
- A risc-v simulator based on SystrmC☆14Jan 7, 2022Updated 4 years ago
- BlockCAT token sale smart contracts.☆11Oct 19, 2017Updated 8 years ago