Noise-robust de-duplication at scale
☆19Apr 9, 2023Updated 2 years ago
Alternatives and similar repositories for NEWS-COPY
Users that are interested in NEWS-COPY are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A model(ing framework) for sample efficient OCR☆64Apr 7, 2023Updated 2 years ago
- ☆10Oct 2, 2024Updated last year
- A collection of notebooks for Natural Language Processing☆25Jan 13, 2025Updated last year
- Graffoo shapes for draw.io☆11Apr 14, 2020Updated 5 years ago
- Python implementation of an extension of the Kolmogorov-Smirnov test for multivariate samples☆13Aug 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for 2021 TACL paper on community-specific language☆13Dec 8, 2022Updated 3 years ago
- Korean politics data for research and development.☆12Jun 21, 2016Updated 9 years ago
- WebVOWL integration on a flask application - Converting and Visualizing ontologies on the Web.☆11Jun 15, 2021Updated 4 years ago
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆18Nov 7, 2024Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 8 months ago
- The official Github for the American Stories dataset as in {link}☆130Mar 7, 2024Updated 2 years ago
- Web application for transcribing OCR ground truth from Archive.org☆17Feb 22, 2018Updated 8 years ago
- Author implementation of the paper "Span-based Semantic Parsing for Compositional Generalization"☆17Aug 29, 2021Updated 4 years ago
- ☆14Jul 26, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Detect communities in legal networks☆12Dec 15, 2024Updated last year
- NO LONGER MAINTAINED. Links to alternative packages are in the readme. Code to incorporate staggered treatment adoption (based on appendi…☆21Apr 30, 2023Updated 2 years ago
- Analysis pipeline for Revisiting UID (EMNLP 2021)☆12Oct 24, 2022Updated 3 years ago
- Provides a half day introduction to grammar of graphics☆11Feb 22, 2023Updated 3 years ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- ☆14Feb 20, 2024Updated 2 years ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Official Repository for Dataset Inference for LLMs☆41Jul 25, 2024Updated last year
- predict ethnicity from names☆13Mar 12, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Natural language structuring library☆22Jun 5, 2024Updated last year
- Website and blog for the research group of Mark J. van der Laan☆11Jul 1, 2021Updated 4 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Feb 27, 2026Updated last month
- PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialog…☆28Oct 4, 2021Updated 4 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- decontamination☆29Mar 4, 2026Updated 3 weeks ago
- League of Legends data solution focused on SLO.☆12Jan 9, 2020Updated 6 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- A Stata command to automatically place a calculation into LaTeX -- no more hard coding!☆18Oct 31, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Study of semantic evolution of words over time☆22Mar 24, 2023Updated 3 years ago
- ☆23Feb 3, 2026Updated last month
- ☆25Oct 12, 2021Updated 4 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 6 months ago
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆41Aug 9, 2022Updated 3 years ago