Knowledgator / utca
Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex programs
β30Updated 3 weeks ago
Alternatives and similar repositories for utca:
Users that are interested in utca are comparing it to the libraries listed below
- Trully flash implementation of DeBERTa disentangled attention mechanism.β46Updated 3 weeks ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated last year
- PyTorch implementation for MRLβ18Updated last year
- Pre-train Static Word Embeddingsβ58Updated 3 weeks ago
- β47Updated last year
- Embedding Recycling for Language modelsβ38Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ71Updated 9 months ago
- Generalist and Lightweight Model for Text Classificationβ123Updated this week
- Source code and data for Like a Good Nearest Neighborβ28Updated 3 months ago
- Using short models to classify long textsβ21Updated 2 years ago
- GLiNER model in a FastAPI microservice.β42Updated 4 months ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β30Updated 8 months ago
- Efficient few-shot learning with cross-encoders.β51Updated last year
- A RAG that can scale π§π»βπ»β11Updated 11 months ago
- StAtutory Reasoning Assessmentβ13Updated 2 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β76Updated 6 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ13Updated 8 months ago
- β41Updated 2 years ago
- Plug-and-play document processing pipelines. No training. Batteries included.β57Updated last week
- EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extractionβ24Updated 11 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engineβ31Updated 3 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ19Updated 3 months ago
- β45Updated 3 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β48Updated last year
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domainβ¦β52Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchiβ¦β33Updated 11 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- Fact checking baseline combining dense retrieval and textual entailmentβ28Updated 3 months ago
- Python library to use Pleias-RAG modelsβ36Updated this week