nateraw / spaces-docker-templatesView external linksLinks
ππ€ A collection of templates for Hugging Face Spaces
β35Oct 9, 2023Updated 2 years ago
Alternatives and similar repositories for spaces-docker-templates
Users that are interested in spaces-docker-templates are comparing it to the libraries listed below
Sorting:
- Scripts to convert datasets from various sources to Hugging Face Datasets.β57Oct 26, 2022Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β28Apr 17, 2024Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"β28Oct 3, 2021Updated 4 years ago
- GitHub action that'll sync files from a GitHub Repo with the Hugging Face Hub π€β79Oct 30, 2024Updated last year
- User-friendly viewer for Parquet filesβ10Jan 10, 2026Updated last month
- High-performance, asynchronous Python HTTP client library designed for faster file transfers using concurrency, semaphores, and fault-tolβ¦β59May 12, 2025Updated 9 months ago
- Code for SaGe subword tokenizer (EACL 2023)β27Nov 30, 2024Updated last year
- Python Module implementing SRPβ12Jul 29, 2022Updated 3 years ago
- FlexiTokensβ19Dec 27, 2025Updated last month
- π Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignmentβ11Apr 6, 2025Updated 10 months ago
- β16Aug 10, 2022Updated 3 years ago
- β13Dec 6, 2024Updated last year
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"β18May 15, 2025Updated 9 months ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.β13Nov 21, 2023Updated 2 years ago
- ANE accelerated embedding models!β20Dec 11, 2024Updated last year
- PathPiece tokenizerβ13Nov 10, 2024Updated last year
- β16Dec 14, 2022Updated 3 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"β13Dec 14, 2021Updated 4 years ago
- Command Line Interface for running π€ Transformers Image Classification locallyβ19May 8, 2025Updated 9 months ago
- The training codes of Jasper-Token-Compression-600Mβ19Nov 19, 2025Updated 2 months ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paperβ14Aug 9, 2021Updated 4 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGIβ¦β16May 4, 2022Updated 3 years ago
- Minimal code to train ELMo models in recent versions of TensorFlowβ14Apr 30, 2023Updated 2 years ago
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognitionβ15Jun 28, 2023Updated 2 years ago
- DImensionality REduction in JAXβ24Nov 21, 2025Updated 2 months ago
- Drop in replacement for OpenAI, but with Open models.β156May 11, 2023Updated 2 years ago
- Exploring semantic similarities between contextualized embeddingsβ14May 18, 2021Updated 4 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)β18May 10, 2023Updated 2 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretrainingβ18Nov 26, 2023Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"β19May 10, 2023Updated 2 years ago
- Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NERβ21Jul 19, 2023Updated 2 years ago
- Learning to Model Editing Processesβ26Aug 3, 2025Updated 6 months ago
- Hugging Face and Pyserini interoperabilityβ19May 18, 2023Updated 2 years ago
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50May 8, 2023Updated 2 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"β21Feb 14, 2024Updated 2 years ago
- Contextualized per-token embeddingsβ34May 11, 2025Updated 9 months ago
- Code for AAAI 2023 Paper : βAlignment-Enriched Tuning for Patch-Level Pre-trained Document Image Modelsββ18Dec 6, 2022Updated 3 years ago
- Temporary remove unused tokens during training to save ram and speed.β23Jun 15, 2025Updated 8 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.β26Nov 25, 2024Updated last year