Preparing DMOZ dataset for my n-Gram LM-based URL classification research
☆31Aug 30, 2014Updated 11 years ago
Alternatives and similar repositories for dmoz-urlclassifier
Users that are interested in dmoz-urlclassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of a pairwise document similarity algorithm using MapReduce.☆15Nov 16, 2011Updated 14 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆19Apr 8, 2026Updated 2 months ago
- RWA recurrent neural networks☆18Apr 14, 2017Updated 9 years ago
- Dmoz RDF parser☆28Jun 22, 2016Updated 10 years ago
- A TensorFlow implementation on Deep Clustering Network(DCN), ICML 2017☆13Oct 21, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Information Retrieval Library (in Python)☆82Dec 18, 2021Updated 4 years ago
- Detect and classify pagination links☆15Sep 9, 2020Updated 5 years ago
- Making survival analysis work in TensorFlow☆19Jun 4, 2017Updated 9 years ago
- Graph clustering and Node embeddings with word2vec☆14Mar 2, 2019Updated 7 years ago
- HTML Elements for IPFS.☆28Sep 28, 2017Updated 8 years ago
- ☆16Sep 13, 2016Updated 9 years ago
- Unsupervised domain adaptation method for relation extraction☆18Jul 16, 2018Updated 7 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Dec 28, 2017Updated 8 years ago
- A wrapper around Python's ctypes for Nim-specific function signatures.☆12Dec 12, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Package to facilitate URL clustering☆71Feb 24, 2016Updated 10 years ago
- Marshal/Unmarshal interface for structs that can encode/decode themselves to URL query strings☆11Jun 6, 2018Updated 8 years ago
- Repository for the CLiPS HAte speech DEtection System [HADES].☆25Apr 5, 2018Updated 8 years ago
- Correlation-aware Change-point Detection via Graph Neural Networks☆16Sep 28, 2020Updated 5 years ago
- ☆18Jul 7, 2025Updated 11 months ago
- code for DOMI☆12Mar 24, 2023Updated 3 years ago
- extract difference between two html pages☆33Apr 8, 2026Updated 2 months ago
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆41May 29, 2017Updated 9 years ago
- Simple implementation of text-based Gridworld game. Intended for use with reinforcement learning algorithms.☆15Apr 29, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- content.rdf.u8.gz☆11Dec 15, 2020Updated 5 years ago
- TURN Rest API Server☆13Feb 6, 2015Updated 11 years ago
- This project contains simple methods to measure sample relatedness and identify potential swaps and contamination☆10Jan 8, 2016Updated 10 years ago
- Classifying the content of domains☆58May 13, 2026Updated last month
- This repository includes code for replicating the results in the paper "Word Ordering Without Syntax" (2016).☆21Dec 8, 2016Updated 9 years ago
- WebDAV client for Rust☆10Jun 6, 2018Updated 8 years ago
- A Theano-based Python implementation of Factorization Machines (Rendle 2010).☆26Dec 13, 2022Updated 3 years ago
- A custom component used for switch page/sheet within tabs based on unity ugui☆14Oct 10, 2017Updated 8 years ago
- Explainable machine learning☆17Mar 17, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Token and Sentence Level Classification with Google's BERT (TensorFlow)☆10Jul 11, 2019Updated 6 years ago
- I had a lot of questions as I went through the Deep Learning Blitz tutorial from pytorch.org, so I made my own tutorial trying to answer …☆12Jun 16, 2018Updated 8 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Dec 17, 2021Updated 4 years ago
- ISO 20275☆10Jun 12, 2026Updated 3 weeks ago
- A assorted collection of free and open notes, courses and books in Statistical learning☆13Oct 4, 2025Updated 9 months ago
- Code for the paper Joint Learning of Hyperbolic Label Embeddings for Hierarchical Multi-label Classification (EACL '21)☆22Nov 3, 2021Updated 4 years ago
- The Clever Algorithms project is an effort to describe a large number of algorithmic techniques from the field of Artificial Intelligence…☆29Oct 28, 2018Updated 7 years ago