dell-research-harvard/NEWS-COPY

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dell-research-harvard/NEWS-COPY)

dell-research-harvard / NEWS-COPY

Noise-robust de-duplication at scale

☆19

Alternatives and similar repositories for NEWS-COPY

Users that are interested in NEWS-COPY are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

luigi-asprino / Graffoo4DrawIO
View on GitHub
Graffoo shapes for draw.io
☆12Jul 13, 2026Updated 2 weeks ago
NewsEye / NLP-Notebooks-Newspaper-Collections
View on GitHub
A collection of notebooks for Natural Language Processing
☆25Jan 13, 2025Updated last year
o-laurent / multivariate-ks-test
View on GitHub
Python implementation of an extension of the Kolmogorov-Smirnov test for multivariate samples
☆13Aug 6, 2023Updated 2 years ago
UCSC-REAL / FLAT
View on GitHub
[ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data
☆14Feb 26, 2025Updated last year
QuantLaw / legal-data-clustering
View on GitHub
Detect communities in legal networks
☆12Dec 15, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jonghkim / awesome-machine-learning-management-research
View on GitHub
☆15Jul 27, 2021Updated 5 years ago
unitedstates / BillMap
View on GitHub
Utilities and applications for the FlatGov project by Demand Progress
☆18Feb 8, 2023Updated 3 years ago
phaiptt125 / newspaper_project
View on GitHub
A supplementary material to "The Evolution of Work in the United States"
☆12Jun 23, 2021Updated 5 years ago
jonathanherzig / span-based-sp
View on GitHub
Author implementation of the paper "Span-based Semantic Parsing for Compositional Generalization"
☆17Aug 29, 2021Updated 4 years ago
Piazzi / Onto4ALL
View on GitHub
Onto4ALL Is a free graphical editor capable of creating, editing and exporting ontologies being guided by an warnings console, an ontolog…
☆18Aug 28, 2025Updated 11 months ago
dasmiq / cs7180-sp2024
View on GitHub
Special Topics in AI: Artificial Intelligence as an Archival Science
☆21May 13, 2024Updated 2 years ago
jbaiter / archiscribe
View on GitHub
Web application for transcribing OCR ground truth from Archive.org
☆18Feb 22, 2018Updated 8 years ago
vishakhpk / verify_citations
View on GitHub
Code to verify citations in a bibtex file
☆15Mar 14, 2026Updated 4 months ago
MemeMedianMode / Staggered-Difference-in-Differences-Example
View on GitHub
Sample code/data to implement heterogeneity-robust difference-in-difference estimators.
☆19Jun 16, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mshukor / eP-ALM
View on GitHub
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Oct 27, 2023Updated 2 years ago
hnesk / browse-ocrd
View on GitHub
An extensible viewer for OCR-D mets.xml files
☆23May 30, 2024Updated 2 years ago
mariru / dynamic_bernoulli_embeddings
View on GitHub
☆15May 30, 2017Updated 9 years ago
Pleias / OCRoscope
View on GitHub
Small python package to measure OCR quality and other related metrics.
☆26Feb 19, 2024Updated 2 years ago
iiasa / emissions_downscaling
View on GitHub
☆14Aug 18, 2021Updated 4 years ago
qinlibo-hit / CI-ToD
View on GitHub
PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialog…
☆28Oct 4, 2021Updated 4 years ago
bltlab / seqscore
View on GitHub
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
☆23Jul 16, 2026Updated last week
percevalw / nlstruct
View on GitHub
Natural language structuring library
☆22Jun 5, 2024Updated 2 years ago
UKPLab / on-emergence
View on GitHub
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Jan 9, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
anirbanl / sparsegen
View on GitHub
Code for the NeurIPS 2018 paper "On Controllable Sparse Alternatives to Softmax"
☆24Oct 10, 2019Updated 6 years ago
elliottash / emotionmeter
View on GitHub
Python code for producing emotionality scores from Gennaro and Ash (2021).
☆20Dec 12, 2021Updated 4 years ago
isapollnik / stattotex
View on GitHub
A Stata command to automatically place a calculation into LaTeX -- no more hard coding!
☆18Oct 31, 2025Updated 8 months ago
tsunghao-huang / Python-Ports-Distance-Calculator
View on GitHub
A distance calculator that is able to return distance between two ports based on the derived sea route.
☆14Sep 11, 2020Updated 5 years ago
ntunlp / LLMSanitize
View on GitHub
An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).
☆62Aug 13, 2024Updated last year
davanstrien / ocr-bench
View on GitHub
Per-collection OCR leaderboards using VLM-as-judge
☆68Jul 16, 2026Updated last week
Yuanhy1997 / HyPe
View on GitHub
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Jul 11, 2023Updated 3 years ago
accessai / dynamic_word_embeddings
View on GitHub
Study of semantic evolution of words over time
☆21Mar 24, 2023Updated 3 years ago
mitdbg / ml-class-iap2017
View on GitHub
☆22Feb 13, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shlomihod / deep-text-eval
View on GitHub
Differnable Readability Measure Regularizer for Neural Network Automatic Text Simplification
☆24Mar 24, 2023Updated 3 years ago
deshen24 / syntheticNN
View on GitHub
☆25Oct 12, 2021Updated 4 years ago
philschmid / multilingual-serverless-qa-aws-lambda
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
cisnlp / mPLM-Sim
View on GitHub
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
☆11Jan 19, 2024Updated 2 years ago
ArthurSpirling / InferenceToTheBestExplanation
View on GitHub
Repo for Spirling and Stewart's "What Good is a Regression?" Project
☆27Jul 2, 2024Updated 2 years ago
TIGER-AI-Lab / PixelWorld
View on GitHub
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆15Sep 12, 2025Updated 10 months ago
lancopku / DynamicKD
View on GitHub
Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"
☆41Aug 9, 2022Updated 3 years ago