Reduce the size of pretrained Hugging Face models via vocabulary trimming.
☆48Dec 28, 2022Updated 3 years ago
Alternatives and similar repositories for hf-trim
Users that are interested in hf-trim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 4 years ago
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆51Sep 5, 2021Updated 4 years ago
- Unofficial implementation of QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition.☆64Oct 15, 2022Updated 3 years ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- ☆10May 31, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Winning Solution for the M5 Competition for Uncertainty Forecasting☆10May 25, 2023Updated 2 years ago
- AirLLM 70B inference with single 4GB GPU☆20Jun 27, 2025Updated 9 months ago
- Pipeline for training Stanford Seq2Seq Neural Machine Translation using PyTorch.☆12Jan 17, 2021Updated 5 years ago
- Unofficial implementation of paper "InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER" (https://arxiv.…☆38Feb 14, 2024Updated 2 years ago
- Generating artificial disfluencies from fluent text easily and promptly☆15Sep 28, 2022Updated 3 years ago
- Code for extracting parallel corpora from pmindia☆17Jan 28, 2020Updated 6 years ago
- A chess engine designed to fit into 4kb☆12Updated this week
- ☆15Apr 12, 2021Updated 4 years ago
- Official Repository of "Multimodal Fusion Based Attentive Networks for Sequential Music Recommendation" accepted in BIGMM 2021☆14May 18, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference☆44Nov 28, 2022Updated 3 years ago
- BSNLP 2021☆33Nov 3, 2024Updated last year
- ☆11Mar 15, 2024Updated 2 years ago
- ever wondered how to run jupyter notebook on servers like ada?☆28Nov 8, 2021Updated 4 years ago
- Finetune Malaysian LLM for Malaysian context embedding task.☆23Apr 27, 2024Updated last year
- An opinionated NLP research template☆10Aug 29, 2024Updated last year
- Large dataset storage format for Pytorch☆45Aug 19, 2021Updated 4 years ago
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- This repo contains my works on the area of NLP, such as Neural Machine Translation, Named Entity Recognition etc,.☆13Sep 19, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- Neural discourse structure for text categorization☆12Aug 27, 2017Updated 8 years ago
- ML Reproducibility Challenge 2020: Electra reimplementation using PyTorch and Transformers☆12Apr 16, 2021Updated 4 years ago
- Lowering PyTorch's Memory Consumption for Selective Differentiation☆12Aug 29, 2024Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Jul 29, 2024Updated last year
- Tools for Measuring Classification Performance for R, Python and Spark☆13Jun 5, 2018Updated 7 years ago
- A Smalltalk Web Browser for Squeak/Smalltalk☆17Apr 18, 2022Updated 3 years ago
- ☆16Jun 14, 2024Updated last year
- ☆13Jul 10, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆590Apr 24, 2023Updated 2 years ago
- Using data from IBM Watson, descriptive and predictive analytics using Python and tableau☆12Dec 23, 2017Updated 8 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- [EMNLP2022] Source code for Neural Machine Translation with Contrastive Translation Memories☆12Feb 15, 2023Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated last year
- Bicleaner fork that uses neural networks☆40Feb 23, 2026Updated last month
- Calculates bounds on the sofa moving problem☆14Sep 12, 2019Updated 6 years ago