String-to-String Algorithms for Natural Language Processing
☆563Jan 25, 2026Updated 4 months ago
Alternatives and similar repositories for string2string
Users that are interested in string2string are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆141Mar 5, 2024Updated 2 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)☆20Mar 21, 2024Updated 2 years ago
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- A Python library for calculating a large variety of metrics from text☆364May 5, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Efficient few-shot learning with Sentence Transformers☆2,741Apr 17, 2026Updated last month
- Salesforce open-source LLMs with 8k sequence length.☆727Jan 31, 2025Updated last year
- ☆25Jan 22, 2024Updated 2 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- Toolkit for domain-specific information retrieval experimentation☆19May 18, 2026Updated last week
- Interpretability for sequence generation models 🐛 🔍☆466Apr 25, 2026Updated last month
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- ☆22Oct 26, 2020Updated 5 years ago
- A BERT-based application for reusable text classification at scale☆37Jul 23, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆374Dec 8, 2022Updated 3 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,985Updated this week
- ☆67Mar 4, 2024Updated 2 years ago
- Code for Learning idiolectal style variation in online register☆10May 18, 2023Updated 3 years ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,641May 13, 2026Updated 2 weeks ago
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆339Dec 18, 2024Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,066May 6, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python Finite-State Toolkit☆68Apr 1, 2026Updated last month
- Active Learning for Text Classification in Python☆642Updated this week
- DSPy: The framework for programming—not prompting—language models☆34,631Updated this week
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,063Mar 7, 2024Updated 2 years ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆395Nov 7, 2023Updated 2 years ago
- Minimal keyword extraction with BERT☆4,176May 13, 2026Updated 2 weeks ago
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,106Nov 14, 2024Updated last year
- ☆13Feb 7, 2023Updated 3 years ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,019Aug 21, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- State-of-the-Art Embeddings, Retrieval, and Reranking☆18,711May 21, 2026Updated last week
- ☆12Jan 29, 2021Updated 5 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,130Apr 20, 2022Updated 4 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,198Oct 16, 2025Updated 7 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆210Aug 31, 2024Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆104Feb 26, 2024Updated 2 years ago
- MARNNs Can Learn Generalized Dyck Languages☆12Nov 11, 2019Updated 6 years ago