stanfordnlp / string2string
String-to-String Algorithms for Natural Language Processing
☆541Updated 7 months ago
Alternatives and similar repositories for string2string:
Users that are interested in string2string are comparing it to the libraries listed below
- 🤖 A PyTorch library of curated Transformer models and their composable components☆884Updated 11 months ago
- ☆357Updated last year
- All-in-one text de-duplication☆664Updated 10 months ago
- A Python Search Engine for Humans 🥸☆211Updated 10 months ago
- SGPT: GPT Sentence Embeddings for Semantic Search☆864Updated last year
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.☆550Updated 9 months ago
- Easily embed, cluster and semantically label text datasets☆516Updated 11 months ago
- Explore and interpret large embeddings in your browser with interactive visualization! 📍☆447Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆189Updated 5 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆122Updated 3 months ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,327Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆487Updated last year
- Interpretability for sequence generation models 🐛 🔍☆408Updated 4 months ago
- Neural Search☆352Updated last week
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆358Updated 11 months ago
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆285Updated 5 months ago
- potato: portable text annotation tool☆323Updated this week
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆552Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆329Updated last year
- Pretraining Efficiently on S2ORC!☆158Updated 4 months ago
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆529Updated last year
- ☆1,193Updated 7 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆999Updated 7 months ago
- 🦙 Integrating LLMs into structured NLP pipelines☆1,213Updated 2 months ago
- An open collection of implementation tips, tricks and resources for training large language models☆471Updated 2 years ago
- Blazing fast framework for fine-tuning similarity learning models☆656Updated 2 months ago
- Active Learning for Text Classification in Python☆608Updated this week
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆839Updated 7 months ago
- utilities for decoding deep representations (like sentence embeddings) back to text☆777Updated last month
- Guideline following Large Language Model for Information Extraction☆354Updated 4 months ago