Preparing DMOZ dataset for my n-Gram LM-based URL classification research
☆31Aug 30, 2014Updated 11 years ago
Alternatives and similar repositories for dmoz-urlclassifier
Users that are interested in dmoz-urlclassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Algorithms for URL Classification☆19Apr 13, 2015Updated 10 years ago
- A database importer for the open directory project (aka dmoz) data☆20Sep 14, 2014Updated 11 years ago
- Implementation of a pairwise document similarity algorithm using MapReduce.☆15Nov 16, 2011Updated 14 years ago
- Classifies webpages into categories defined in DMOZ dataset☆40Dec 14, 2015Updated 10 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆19Feb 10, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A small library that wraps Keras models to pickle them.☆14Jul 17, 2018Updated 7 years ago
- A TensorFlow implementation on Deep Clustering Network(DCN), ICML 2017☆13Oct 21, 2022Updated 3 years ago
- Information Retrieval Library (in Python)☆82Dec 18, 2021Updated 4 years ago
- ☆11Feb 2, 2018Updated 8 years ago
- Making survival analysis work in TensorFlow☆19Jun 4, 2017Updated 8 years ago
- Slides for my intro to deep reinforcement learning at Imperial College☆17Apr 8, 2018Updated 7 years ago
- A Clojure library for use case driven development☆11Dec 25, 2017Updated 8 years ago
- ☆16Sep 13, 2016Updated 9 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Dec 28, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Actively track user engagement and know when they move away from your page.☆11Jan 22, 2026Updated 2 months ago
- A Brainfuck interpreter written in Rust and compiled to WebAssembly☆10Dec 4, 2017Updated 8 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- Marshal/Unmarshal interface for structs that can encode/decode themselves to URL query strings☆11Jun 6, 2018Updated 7 years ago
- Repository for the CLiPS HAte speech DEtection System [HADES].☆24Apr 5, 2018Updated 7 years ago
- Correlation-aware Change-point Detection via Graph Neural Networks☆16Sep 28, 2020Updated 5 years ago
- ☆16Jul 7, 2025Updated 8 months ago
- code for DOMI☆11Mar 24, 2023Updated 3 years ago
- extract difference between two html pages☆33Feb 10, 2026Updated last month
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆41May 29, 2017Updated 8 years ago
- Simple implementation of text-based Gridworld game. Intended for use with reinforcement learning algorithms.☆15Apr 29, 2018Updated 7 years ago
- FFI bindings to libudev☆10Feb 28, 2024Updated 2 years ago
- A collection of data sets for data entrepreneurs from the Centers for Medicare and Medicaid Services synthetic public use files☆16May 30, 2013Updated 12 years ago
- This project contains simple methods to measure sample relatedness and identify potential swaps and contamination☆10Jan 8, 2016Updated 10 years ago
- Classifying the content of domains☆58Sep 13, 2025Updated 6 months ago
- Neural Network for Automatic Negation Detection☆20Aug 1, 2016Updated 9 years ago
- ☆13Nov 29, 2021Updated 4 years ago
- Token and Sentence Level Classification with Google's BERT (TensorFlow)☆10Jul 11, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A system utility package for Torch.☆13Dec 22, 2017Updated 8 years ago
- Kaggle histopathologic cancer detection (playground) competition solution☆11Apr 25, 2019Updated 6 years ago
- I had a lot of questions as I went through the Deep Learning Blitz tutorial from pytorch.org, so I made my own tutorial trying to answer …☆13Jun 16, 2018Updated 7 years ago
- AWS Lambda function handler for converting LaTeX documents into PDFs☆11Nov 16, 2018Updated 7 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Dec 17, 2021Updated 4 years ago
- An experiment with modern C++, suffix trees, and Ukkonen's algorithm for suffix tree construction.☆12Mar 15, 2019Updated 7 years ago
- A assorted collection of free and open notes, courses and books in Statistical learning☆13Oct 4, 2025Updated 5 months ago