☆67Aug 24, 2022Updated 3 years ago
Alternatives and similar repositories for minimal-opt
Users that are interested in minimal-opt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆131Jun 9, 2022Updated 3 years ago
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆20Nov 16, 2022Updated 3 years ago
- ☆10Mar 28, 2022Updated 4 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- ☆20Oct 3, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)☆33Sep 22, 2025Updated 6 months ago
- ☆12Mar 7, 2022Updated 4 years ago
- ☆78Dec 7, 2023Updated 2 years ago
- ☆16Oct 6, 2022Updated 3 years ago
- A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.☆14Nov 23, 2022Updated 3 years ago
- ☆103Apr 11, 2025Updated 11 months ago
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding☆10Jul 15, 2023Updated 2 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆16Dec 9, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ✒️ ChatGPT as a writing partner.☆14Mar 6, 2023Updated 3 years ago
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- Implementation of Mixout with PyTorch☆75Dec 21, 2022Updated 3 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Apr 13, 2023Updated 2 years ago
- Source code of paper “A Novel Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation”☆16Nov 25, 2021Updated 4 years ago
- Statistics and Accepted paper list of ACL 2020 with arXiv link☆23May 30, 2020Updated 5 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- ☆13Jun 20, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated 2 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- ☆20Nov 23, 2022Updated 3 years ago
- ☆26Jan 9, 2023Updated 3 years ago
- Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models☆143Sep 4, 2022Updated 3 years ago
- CLaF: Open-Source Clova Language Framework☆216Mar 26, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell☆14May 28, 2023Updated 2 years ago
- NABERT model for solving the DROP dataset☆26Jul 1, 2019Updated 6 years ago
- JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…☆13Nov 5, 2024Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆476Mar 7, 2024Updated 2 years ago
- Run-time validation of tensors for machine-learning systems.☆11Apr 8, 2021Updated 4 years ago
- Computing gradients and Hessians of feed-forward networks with GPU acceleration☆20Feb 14, 2024Updated 2 years ago
- Basic exercises of chinese information processing☆34Sep 1, 2021Updated 4 years ago