Gzip and nearest neighbors for text classification
☆57Aug 1, 2023Updated 2 years ago
Alternatives and similar repositories for nn_plus_gzip
Users that are interested in nn_plus_gzip are comparing it to the libraries listed below
Sorting:
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Aug 15, 2023Updated 2 years ago
- ☆30Nov 23, 2025Updated 3 months ago
- Prototype for a Category Theory-based GNN Library☆15Apr 20, 2022Updated 3 years ago
- ☆14Dec 30, 2022Updated 3 years ago
- Open source package corrections, policy rules and other configuration files for the OSS Review Toolkit.☆21Updated this week
- Identify irrelevant negative keywords from your Google Ads account☆32Feb 11, 2026Updated 3 weeks ago
- ☆18Aug 28, 2023Updated 2 years ago
- Code for the Data Without Labels☆29May 14, 2025Updated 9 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- An extension of Py-Boost to probabilistic modelling☆24Jan 19, 2023Updated 3 years ago
- Quantification of Uncertainty with Adversarial Models☆29Jul 11, 2023Updated 2 years ago
- Pydata talk - Football Analytics Using Hierarchical Bayesian Models in PyMC☆25Oct 30, 2021Updated 4 years ago
- Data Structures with Python(AIX20001) 강의 자료실☆18Jun 14, 2024Updated last year
- 👁️🗨️ Scientists often do the same bad stuff. Automate giving deterministic feedback during peer review with determinstic (LLM-free)☆30May 9, 2025Updated 9 months ago
- OWASP Foundation web repository☆21Jan 7, 2026Updated last month
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- One stop-shop for matplotlib based visualizations☆10Jun 9, 2025Updated 8 months ago
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…☆30Apr 4, 2023Updated 2 years ago
- ☆14Oct 23, 2025Updated 4 months ago
- Fast Bayesian A/B and Multivariate testing.☆36Dec 27, 2022Updated 3 years ago
- ☆39Jul 13, 2022Updated 3 years ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- Advanced Analytics data collection for M365 usage☆19Updated this week
- Biological Relationships - Biorels data preparation infrastructure for biology and drug discovery☆15May 19, 2025Updated 9 months ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- ☆13Nov 21, 2025Updated 3 months ago
- Power Apps Service Desk template Fixed☆13Dec 22, 2024Updated last year
- Models for automatically transforming toxic text to neutral☆36Oct 5, 2023Updated 2 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Oct 25, 2022Updated 3 years ago
- Agriculture Land and Commission System☆10Updated this week
- ☆83Apr 16, 2024Updated last year
- Claudette's sister, a helper for OpenAI GPT☆44Jan 29, 2026Updated last month
- Repository for Protein-Vec, a protein embedding mixture of experts model☆38Jan 24, 2024Updated 2 years ago
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)☆160Oct 10, 2022Updated 3 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Chaos - a first of its kind framework for researching Reciprocal Recommender Systems (RRS).☆12Nov 7, 2021Updated 4 years ago
- An LLM-based tool for literature synthesis☆14Jun 20, 2024Updated last year
- Dataset Catalogue Homepage for Indonesian Languages☆10Feb 19, 2024Updated 2 years ago
- A toolkit for using non-traditional meteorological observations☆20Updated this week