String-to-String Algorithms for Natural Language Processing
☆565Jan 25, 2026Updated 3 months ago
Alternatives and similar repositories for string2string
Users that are interested in string2string are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆141Mar 5, 2024Updated 2 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)☆20Mar 21, 2024Updated 2 years ago
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- A Python library for calculating a large variety of metrics from text☆363Mar 20, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Efficient few-shot learning with Sentence Transformers☆2,724Apr 17, 2026Updated 2 weeks ago
- Salesforce open-source LLMs with 8k sequence length.☆727Jan 31, 2025Updated last year
- ☆25Jan 22, 2024Updated 2 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- Toolkit for domain-specific information retrieval experimentation☆19Updated this week
- Interpretability for sequence generation models 🐛 🔍☆465Apr 25, 2026Updated last week
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- ☆22Oct 26, 2020Updated 5 years ago
- A BERT-based application for reusable text classification at scale☆37Jul 23, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆374Dec 8, 2022Updated 3 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,954Apr 27, 2026Updated last week
- ☆67Mar 4, 2024Updated 2 years ago
- Code for Learning idiolectal style variation in online register☆10May 18, 2023Updated 2 years ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,578Feb 20, 2026Updated 2 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,033Apr 20, 2026Updated 2 weeks ago
- Python Finite-State Toolkit☆65Apr 1, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Active Learning for Text Classification in Python☆640Apr 17, 2026Updated 2 weeks ago
- DSPy: The framework for programming—not prompting—language models☆34,180Updated this week
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,064Mar 7, 2024Updated 2 years ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆394Nov 7, 2023Updated 2 years ago
- Minimal keyword extraction with BERT☆4,163Feb 3, 2026Updated 3 months ago
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,106Nov 14, 2024Updated last year
- ☆13Feb 7, 2023Updated 3 years ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,018Aug 21, 2024Updated last year
- State-of-the-Art Text Embeddings☆18,615Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,128Apr 20, 2022Updated 4 years ago
- ☆12Jan 29, 2021Updated 5 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,168Oct 16, 2025Updated 6 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆210Aug 31, 2024Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆104Feb 26, 2024Updated 2 years ago
- MARNNs Can Learn Generalized Dyck Languages☆12Nov 11, 2019Updated 6 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆20Feb 7, 2023Updated 3 years ago