rasbt / nn_plus_gzip
Gzip and nearest neighbors for text classification
β57Updated last year
Alternatives and similar repositories for nn_plus_gzip:
Users that are interested in nn_plus_gzip are comparing it to the libraries listed below
- π€ Trade any tensors over the networkβ30Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.β47Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebookβ14Updated last month
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 4 months ago
- Command Line Interface for Hugging Face Inference Endpointsβ66Updated last year
- QLoRA for Masked Language Modelingβ22Updated last year
- A miniture AI training framework for PyTorchβ42Updated 3 months ago
- β24Updated last year
- β35Updated 2 weeks ago
- NLP with Rust for Python π¦πβ62Updated 11 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated last year
- ML/DL Math and Method notesβ60Updated last year
- Highly commented implementations of Transformers in PyTorchβ136Updated last year
- Pre-train Static Word Embeddingsβ58Updated 3 weeks ago
- Using short models to classify long textsβ21Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- Training and Inference Notebooks for the RedPajama (OpenLlama) modelsβ18Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ35Updated last year
- β47Updated last year
- Study the temporal performance degradation of machine learning models.β16Updated last year
- β77Updated 11 months ago
- Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFaceβ41Updated 5 months ago
- Table detection with Florence.β13Updated 9 months ago
- Drift detection module for machine learning pipelines.β25Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ30Updated 7 months ago
- An introduction to LLM Samplingβ77Updated 4 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.β25Updated 2 years ago
- I learn about and explain quantizationβ26Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago