rasbt / nn_plus_gzip
Gzip and nearest neighbors for text classification
β55Updated last year
Alternatives and similar repositories for nn_plus_gzip:
Users that are interested in nn_plus_gzip are comparing it to the libraries listed below
- β76Updated 9 months ago
- NLP with Rust for Python π¦πβ61Updated 9 months ago
- Highly commented implementations of Transformers in PyTorchβ132Updated last year
- Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFaceβ40Updated 4 months ago
- Command Line Interface for Hugging Face Inference Endpointsβ66Updated 11 months ago
- β24Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebookβ15Updated 3 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 3 months ago
- β47Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated last year
- An introduction to LLM Samplingβ77Updated 3 months ago
- Like picoGPT but for BERT.β50Updated 2 years ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- ML/DL Math and Method notesβ58Updated last year
- QLoRA for Masked Language Modelingβ21Updated last year
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadβ18Updated 11 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ29Updated 6 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β38Updated 11 months ago
- Drift detection module for machine learning pipelines.β21Updated last year
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.β46Updated last year
- Tools to make language models a bit easier to useβ39Updated 2 weeks ago
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated 2 years ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.β25Updated last year
- deep learning with pytorch lightningUpdated 4 months ago
- β19Updated 7 months ago
- β34Updated last year
- Generalist and Lightweight Model for Text Classificationβ92Updated this week