loretoparisi / hf-experiments
Experiments with Hugging Face π¬ π€
β45Updated 6 months ago
Alternatives and similar repositories for hf-experiments:
Users that are interested in hf-experiments are comparing it to the libraries listed below
- Documentation effort for the BookCorpus datasetβ33Updated 3 years ago
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β34Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β76Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER modelsβ32Updated 2 years ago
- Execute arbitrary SQL queries on π€ Datasetsβ32Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated 11 months ago
- Detecting gibberish as a type of sentiment analysis with GPT2β24Updated 4 years ago
- Speech in Flax/JAXβ15Updated 2 years ago
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Updated 2 years ago
- β17Updated 6 months ago
- Source code for the Apple reproductionβ31Updated 3 years ago
- Using short models to classify long textsβ21Updated last year
- URL downloader supporting checkpointing and continuous checksumming.β19Updated last year
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- docker for HF wav2vec2-sprintβ13Updated 3 years ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)β30Updated last month
- A π₯ cookiecutter template for building Hugging Face Spacesβ11Updated 3 years ago
- Finds linguistic patterns effortlesslyβ35Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Updated 3 years ago
- A Streamlit app to add structured tags to a dataset cardβ22Updated 2 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.β11Updated 5 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 2 years ago
- GPT-jax based on the official huggingface libraryβ13Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β25Updated 10 months ago
- Zero-shot Audio Classification using Whisperβ78Updated 2 years ago
- β30Updated 2 years ago
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) distaβ¦β23Updated 5 months ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.β58Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks availableβ14Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.β22Updated 2 years ago