☆36Feb 13, 2023Updated 3 years ago
Alternatives and similar repositories for alanno
Users that are interested in alanno are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Podium: a framework agnostic Python NLP library for data loading and preprocessing☆60Dec 12, 2022Updated 3 years ago
- A rule based sentence segmentation library.☆14Jul 17, 2023Updated 2 years ago
- spaCy + UDPipe☆168Apr 19, 2022Updated 4 years ago
- Ensemble topic modeling with matrix factorization☆24May 10, 2018Updated 8 years ago
- The CODWOE shared task invites you to compare two types of semantic descriptions: dictionary glosses and word embedding representations. …☆12Jul 13, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Oct 20, 2022Updated 3 years ago
- ☆22Aug 30, 2025Updated 8 months ago
- 4th Year MSci Dissertation☆11Oct 3, 2022Updated 3 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆17May 2, 2025Updated last year
- "Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)☆91Oct 6, 2023Updated 2 years ago
- [ICML 2024] Fine-Grained Classes and How to Find Them☆13Jun 21, 2024Updated last year
- ☆88Jul 30, 2024Updated last year
- Repository for SF2SE3: Clustering Scene Flow into SE(3)-Motions via Proposal and Selection☆12Jul 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Oct 3, 2023Updated 2 years ago
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- Tokenize and clean strings in Python☆11Jan 11, 2018Updated 8 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- Repository for "Training Language Models To Explain Their Own Computations"☆22Dec 22, 2025Updated 4 months ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Feb 16, 2026Updated 2 months ago
- A Python Wrapper To Retrieve Data From The CrowdTangle API☆11Mar 26, 2026Updated last month
- Information Retrieval Relevance Judging System☆29Jan 17, 2022Updated 4 years ago
- The official code of ALLECS: A Lightweight Language Error Correction System☆11Mar 12, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆15Feb 4, 2021Updated 5 years ago
- Python implementation of CETR: Content Extraction via Tag Ratios☆13Jan 18, 2012Updated 14 years ago
- SemEval-2018 Task 12: The Argument Reasoning Comprehension Task☆31Feb 22, 2018Updated 8 years ago
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- Source code to reproduce results from Panoptic Swiftnet paper.☆16Oct 18, 2022Updated 3 years ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Apr 27, 2022Updated 4 years ago
- Novella is a build system for processing data in a temporary directory isolated from the project, designed for documentation source code …☆13Jan 11, 2024Updated 2 years ago
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Sep 21, 2023Updated 2 years ago
- PyTorch C++ Extension Example☆15Mar 4, 2018Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 3 years ago
- A Swift reimplementation of a Claude Code-style coding agent, built stage by stage to explore what makes coding agents work☆154Mar 25, 2026Updated last month
- Slides from various talks I gave☆18Oct 25, 2018Updated 7 years ago
- 🌸 Train floret vectors☆18May 4, 2023Updated 3 years ago
- Shallow Bayesian Meta Learning for Real World Few-shot Recognition☆23Nov 18, 2021Updated 4 years ago
- This data release is meant to accompany and document the paper: https://arxiv.org/abs/2004.11997 Collecting Entailment Data for Pretrain…☆14Sep 29, 2020Updated 5 years ago
- An efficient implementation of Partitioned Label Trees & its variations for extreme multi-label classification☆92Feb 20, 2024Updated 2 years ago