Source code for the Medium article "Extracting the author of news stories with DOM-based segmentation and BERT"
☆29Jan 16, 2020Updated 6 years ago
Alternatives and similar repositories for AuthorExtractor
Users that are interested in AuthorExtractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the paper "Web2Text: Deep Structured Boilerplate Removal", full paper @ ECIR'18☆169Oct 28, 2021Updated 4 years ago
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Mar 29, 2021Updated 5 years ago
- ☆19Jun 4, 2020Updated 5 years ago
- Web content extraction using machine learning☆34Mar 3, 2021Updated 5 years ago
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A package for generating synthetic data and fine-tuning a gliner model.☆14Jun 5, 2024Updated last year
- Code to train Sentence BERT Japanese model for Hugging Face Model Hub☆11Aug 8, 2021Updated 4 years ago
- ☆17Sep 20, 2021Updated 4 years ago
- ☆11May 26, 2022Updated 3 years ago
- 记录自己用的BILSTM-CRF、ELMo、BERT等来做NER任务的代码。☆27Feb 6, 2020Updated 6 years ago
- Serving Uncertainty with Bayesian inference, using PyMC3 with Bodywork☆14Jun 21, 2022Updated 3 years ago
- A simple multicohort LTV calculator for subscriptions☆11Mar 7, 2023Updated 3 years ago
- ☆12Apr 14, 2023Updated 3 years ago
- JavaFlow reimagines the core ideas of FoundationDB's Flow actor framework in idiomatic Java, leveraging JDK continuations instead of any …☆24Feb 16, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆11Mar 15, 2022Updated 4 years ago
- universal-datalakehouse-postgres-ingestion-deltastreamer☆10Apr 7, 2024Updated 2 years ago
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- This repository shows how to efficiently process variable-length sequences in TensorFlow.☆14Apr 26, 2022Updated 4 years ago
- 公司、企业名称模糊匹配,基于词频的公司名主体提取 ,基于编辑距离的匹配度☆41Dec 21, 2020Updated 5 years ago
- Experimental collections library☆14Mar 27, 2019Updated 7 years ago
- Neat table/list views with filtering and pagination support; powered by React.☆13Jan 25, 2023Updated 3 years ago
- ☆10May 1, 2025Updated last year
- Numpyro examples in Python notebooks☆11Sep 7, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python wrapper for phantomjs☆15May 28, 2021Updated 4 years ago
- Celery workers as independent microservices deployed using Docker Swarm.☆11Dec 4, 2020Updated 5 years ago
- A python library to enable GenAI and LLMOps within Google Cloud Platform☆17Mar 12, 2026Updated 2 months ago
- ☆17Oct 1, 2023Updated 2 years ago
- Code for "Probabilistic forecasting of cross-sectional returns: A Bayesian dynamic factor model with heteroskedasticity"☆13Jul 6, 2024Updated last year
- An observatory of anglicism usage in the Spanish press☆11May 11, 2026Updated last week
- Load Testing ML Microservices for Robustness and Scalability☆14Feb 8, 2022Updated 4 years ago
- Generates random utf-8 strings for fuzz t�sting character encoding probl�ms☆11Aug 21, 2015Updated 10 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Oct 9, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Nov 26, 2025Updated 5 months ago
- Systematic Review Query Visualisation and Understanding Interface☆17Dec 5, 2025Updated 5 months ago
- ☆13Jul 8, 2024Updated last year
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- 🗺 Pathfinding visualizer web application☆12Feb 15, 2021Updated 5 years ago
- Machine learning RCT classifier☆23Mar 25, 2023Updated 3 years ago
- State-space model inference with JAX☆64Updated this week