mediacloud/metadata-lib

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mediacloud/metadata-lib)

mediacloud / metadata-lib

How Media Cloud approaches extracting metadata from online news stories

☆17

Alternatives and similar repositories for metadata-lib

Users that are interested in metadata-lib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

internetarchive / iacopilot
View on GitHub
Summarize and ask questions about items in the Internet Archive
☆17Apr 1, 2023Updated 3 years ago
JohnMarkOckerbloom / onlinebooks
View on GitHub
Selected code and data for The Online Books Page and related applications
☆12Jul 1, 2026Updated 3 weeks ago
rodekruis / caladrius
View on GitHub
Automated Damage Assessment using Deep Learning
☆14Jun 25, 2025Updated last year
commonsearch / gumbocy
View on GitHub
Python binding for gumbo-parser using Cython
☆14Aug 16, 2016Updated 9 years ago
natliblux / warc-safe
View on GitHub
A tool for detecting viruses and NSFW material in WARC files
☆18Jul 15, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
david-macmahon / hashpipe
View on GitHub
High Availability Shared Pipeline Engine
☆17Sep 15, 2023Updated 2 years ago
bethelmelesse / UnifiedCrawl
View on GitHub
☆17Nov 26, 2024Updated last year
j9recurses / archextract
View on GitHub
Illuminating the scope and content of a digital text collections
☆13Jul 28, 2015Updated 10 years ago
dogancanbakir / soft-404
View on GitHub
A classifier for detecting soft 404 pages
☆17Sep 10, 2022Updated 3 years ago
internetarchive / gowarc
View on GitHub
Read and write WARC files in Go
☆53Updated this week
MarcoMarcoaldi / ASN_IPTables_Blocker.sh
View on GitHub
A pure Linux Bash Script for block IP Range using Autonomous System Number
☆17Mar 11, 2026Updated 4 months ago
ska-sa / spead2
View on GitHub
Library for the Streaming Protocol for Exchange of Astronomical Data (SPEAD)
☆27Updated this week
SimonRogers / datajournalism
View on GitHub
Introduction to data journalism
☆14Dec 19, 2018Updated 7 years ago
Vinothsuku / insightsR
View on GitHub
automated insights for tabular data
☆10Feb 10, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
public-people / scrape-news
View on GitHub
Scrape South African news
☆13May 22, 2023Updated 3 years ago
SamProell / samproell.io-code
View on GitHub
Collection of all code samples referenced on https://www.samproell.io
☆15Apr 8, 2024Updated 2 years ago
ramneekhanda / crypto
View on GitHub
https://www.coursera.org/learn/cryptocurrency
☆12Oct 28, 2017Updated 8 years ago
leonjovanovic / keywords-extraction
View on GitHub
Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.
☆12May 22, 2023Updated 3 years ago
kowndinya-renduchintala / POSIX
View on GitHub
POSIX: A Prompt Sensitivity Index for Language Models
☆13Nov 13, 2024Updated last year
serpapi / automatic-images-classifier-generator
View on GitHub
Generate machine learning models fully automatically to clasiffiy any images using SERP data
☆12Aug 25, 2022Updated 3 years ago
mitmedialab / MediaCloud-Dashboard
View on GitHub
Front-end for the MediaCloud database
☆16Apr 3, 2018Updated 8 years ago
sombriks / node-libgpiod
View on GitHub
libgpiod node bindings
☆43May 7, 2026Updated 2 months ago
opengeos / solara-maxar
View on GitHub
A Solara web app for visualizing Maxar Open Data
☆36Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
gabrielmbmb / candle-holder
View on GitHub
A Rust crate offering similar functionality to the Python transformers package using Candle.
☆15Nov 19, 2024Updated last year
DeepsMoseli / Siamese-LSTM-on-sentence-similarity
View on GitHub
Using Siamese LSTM to classify repeated quora questions. Attempted pretrained bert embeddings, Word2Vec and training own embeddings toget…
☆10Aug 28, 2020Updated 5 years ago
alsonicr / quarto-apa7
View on GitHub
An apa7 template for quarto/posit
☆12Jan 25, 2023Updated 3 years ago
tingofurro / headline_grouping
View on GitHub
Codebase, data and models for the Headline Grouping paper at NAACL2021
☆12Oct 2, 2022Updated 3 years ago
smitkiri / news-qa
View on GitHub
Reading comprehension based question-answering model for news articles.
☆11Jun 22, 2022Updated 4 years ago
AmericanRedCross / street-view-green-view
View on GitHub
☆23Jan 9, 2025Updated last year
Algram / PodcastAutomator
View on GitHub
🎧 Simple bash-script to automatically download the most recent podcasts from a list of rss-feeds and upload them to your Dropbox.
☆10Nov 30, 2015Updated 10 years ago
AbdulRehman555 / 3D-Mesh-Generation
View on GitHub
3D Mesh Generation from 2D Images in Python
☆13Feb 12, 2024Updated 2 years ago
PD-Mera / ctranslate2-triton-backend
View on GitHub
Triton backend for https://github.com/OpenNMT/CTranslate2
☆11Aug 20, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
G-Research / dgraph-dbpedia
View on GitHub
Pre-processing DBpedia datasets to load into Dgraph
☆13Mar 6, 2022Updated 4 years ago
pacotvj99 / testsampleR
View on GitHub
☆14Jan 25, 2026Updated 5 months ago
akimfromparis / RAG-Japanese
View on GitHub
Open source RAG with Llama Index for Japanese LLM in low resource settting
☆10May 12, 2025Updated last year
THU-KEG / Entity-Linking-Trends-and-History
View on GitHub
Papers about the trend of Entity Linking in recent years.
☆11Sep 5, 2022Updated 3 years ago
newsdev / apfake
View on GitHub
A command-line tool for generating AP API JSON files for testing elections applications.
☆15Jul 5, 2022Updated 4 years ago
yang0369 / Information_Extraction
View on GitHub
end-to-end information extraction pipeline built by LayoutLMV2, pretrained model from HuggingFace
☆11Aug 15, 2023Updated 2 years ago
ibraAbuKaff / react-fancy-visa-card
View on GitHub
React js implementation for visa credit card - Payment Form
☆21Feb 27, 2023Updated 3 years ago