Scripts to convert datasets from various sources to Hugging Face Datasets.
β57Oct 26, 2022Updated 3 years ago
Alternatives and similar repositories for huggingface-datasets-converter
Users that are interested in huggingface-datasets-converter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ππ€ A collection of templates for Hugging Face Spacesβ34Oct 9, 2023Updated 2 years ago
- Examples using π€ Hub to share and reload machine learning modelsβ32Nov 3, 2022Updated 3 years ago
- A composite GitHub Action to login to the HuggingFace Hubβ15Feb 4, 2023Updated 3 years ago
- β10Mar 29, 2021Updated 5 years ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Mar 20, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Official code for the paper: "Metadata Archaeology"β19May 10, 2023Updated 3 years ago
- π€πΌοΈ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.β318May 7, 2024Updated 2 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β29Apr 17, 2024Updated 2 years ago
- Execute arbitrary SQL queries on π€ Datasetsβ32Jan 24, 2024Updated 2 years ago
- High-performance, asynchronous Python HTTP client library designed for faster file transfers using concurrency, semaphores, and fault-tolβ¦β60May 12, 2025Updated last year
- Educational materials for universitiesβ389Sep 29, 2023Updated 2 years ago
- Notebooks to demonstrate TimmWrapperβ16Jan 16, 2025Updated last year
- This repo consists of code for plotting top loss imagesβ13May 18, 2020Updated 6 years ago
- A collection of generative and training notebooks getting mirrored to google colab.β12May 29, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.β161Apr 3, 2024Updated 2 years ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ288Jul 11, 2024Updated last year
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.β10Nov 5, 2020Updated 5 years ago
- Chrome Extension for exploring Hugging Face datasets πβ48Sep 18, 2024Updated last year
- β13Dec 12, 2021Updated 4 years ago
- Build fast gradio demos of fastai learnersβ35Sep 23, 2021Updated 4 years ago
- [ICLR'23] Code to reproduce the results in the paper "PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs"β58Jun 8, 2023Updated 3 years ago
- An open collection of implementation tips, tricks and resources for training large language modelsβ502Mar 8, 2023Updated 3 years ago
- Visual Taste Approximator (VTA) is a very simple tool that helps anyone create an automatic replica of themselves that can approximate thβ¦β40Sep 11, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Enterprise Scale NLP with Hugging Face & SageMaker Workshop seriesβ241Jan 20, 2023Updated 3 years ago
- Notebooks for docarray, Jina, Finetuner, and other products from Jina AIβ12Mar 31, 2022Updated 4 years ago
- GitHub action that'll sync files from a GitHub Repo with the Hugging Face Hub π€β82Oct 30, 2024Updated last year
- π€ Disaggregators: Curated data labelers for in-depth analysis.β69Feb 8, 2023Updated 3 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Sep 19, 2025Updated 8 months ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.β12Mar 6, 2023Updated 3 years ago
- OpenAI CLIP based image generator with complex config file controlled transformation and training pipelinesβ19Jan 4, 2022Updated 4 years ago
- Evaluate Transformers from the Hub π₯β14May 26, 2026Updated 3 weeks ago
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.β14Mar 9, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β15Aug 3, 2021Updated 4 years ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qualβ¦β47Dec 2, 2024Updated last year
- β27Jun 5, 2026Updated last week
- Matching in GAN latent space for better bias benchmarking and semantic image editing. πΆπ»π§πΎπ©πΌβπ¦°π±π½ββοΈπ΄πΎβ20Mar 24, 2023Updated 3 years ago
- Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"β21Dec 10, 2021Updated 4 years ago
- The spiritual successor to knockknock for PyTorch Lightning, get notified when your training endsβ77Updated this week
- Fancylit is a python module that contains pre-packaged Streamlit code to render fancy visualizations, run modeling tasks, and data explorβ¦β11Oct 19, 2021Updated 4 years ago