zphang/minimal-opt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zphang/minimal-opt)

zphang / minimal-opt

☆67

Alternatives and similar repositories for minimal-opt

Users that are interested in minimal-opt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zphang / minimal-gpt-neox-20b
View on GitHub
☆131Jun 9, 2022Updated 4 years ago
suzgunmirac / crowd-sampling
View on GitHub
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding
☆20Nov 16, 2022Updated 3 years ago
zja-nlp / NAT_with_DAD
View on GitHub
☆10Mar 28, 2022Updated 4 years ago
zhliu0106 / learning-to-refuse
View on GitHub
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
☆10Dec 13, 2024Updated last year
Isaac-Flath / QuartoTemplates
View on GitHub
☆20Oct 3, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
neulab / neural-lpcfg
View on GitHub
The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)
☆33Sep 22, 2025Updated 10 months ago
deep-spin / sparse-communication
View on GitHub
☆12Mar 7, 2022Updated 4 years ago
leogao2 / lm_dataformat
View on GitHub
☆79Dec 7, 2023Updated 2 years ago
amy-hyunji / Contextualized-Generative-Retrieval
View on GitHub
☆16Oct 6, 2022Updated 3 years ago
lbox-kr / lbox-open
View on GitHub
☆108Apr 11, 2025Updated last year
lezhang7 / TreeMix
View on GitHub
[NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
☆10Jul 15, 2023Updated 3 years ago
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated 2 years ago
craffel / comp664-deep-learning-spring-2023
View on GitHub
Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC
☆14Apr 17, 2023Updated 3 years ago
ZurichNLP / coverage-contrastive-conditioning
View on GitHub
Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…
☆22Apr 13, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Spico197 / writing-comrade
View on GitHub
✒️ ChatGPT as a writing partner.
☆14Mar 6, 2023Updated 3 years ago
CLAIRE-Labo / flash_attention
View on GitHub
A basic pure pytorch implementation of flash attention
☆17Oct 28, 2024Updated last year
bloodwass / mixout
View on GitHub
Implementation of Mixout with PyTorch
☆75Dec 21, 2022Updated 3 years ago
hunkim / ACL-2020-Papers
View on GitHub
Statistics and Accepted paper list of ACL 2020 with arXiv link
☆23May 30, 2020Updated 6 years ago
harvardnlp / cascaded-generation
View on GitHub
Cascaded Text Generation with Markov Transformers
☆130Mar 20, 2023Updated 3 years ago
UCL-COMP0233-2022-2023 / RSE-Classwork
View on GitHub
☆11Oct 13, 2023Updated 2 years ago
neukg / KAT-TSLF
View on GitHub
Source code of paper “A Novel Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation”
☆16Nov 25, 2021Updated 4 years ago
srush / tangent
View on GitHub
Source-to-Source Debuggable Derivatives in Pure Python
☆15Jan 23, 2024Updated 2 years ago
r-three / RAD
View on GitHub
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆45Oct 1, 2025Updated 9 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
tm4roon / data-augmentation-for-nlp
View on GitHub
An implementation of data augmentation methods for natural language processing tasks.
☆13Jul 25, 2024Updated 2 years ago
Nyandwi / MultiModal-Learning-Research
View on GitHub
A curated resources on what's happening in multimodal learning. Features recent papers, books, related lectures, and other relevant resou…
☆16Apr 28, 2023Updated 3 years ago
seonghyeonye / TAPP
View on GitHub
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆79Sep 13, 2024Updated last year
sustcsonglin / mamba-triton
View on GitHub
☆52Jan 28, 2024Updated 2 years ago
dame-cell / Triformer
View on GitHub
Transformers components but in Triton
☆34May 9, 2025Updated last year
uma-pi1 / kgt5-context
View on GitHub
☆12Jun 20, 2024Updated 2 years ago
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago
elizabethnewman / hessQuik
View on GitHub
Computing gradients and Hessians of feed-forward networks with GPU acceleration
☆20Feb 14, 2024Updated 2 years ago
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cchan / nanoGPT-fp8
View on GitHub
☆13May 8, 2023Updated 3 years ago
samhita-alla / geolocator
View on GitHub
Location Predictor 📍
☆16Jul 13, 2026Updated 2 weeks ago
wmt-conference / wmt21-news-systems
View on GitHub
☆26Jan 9, 2023Updated 3 years ago
benzakenelad / BitFit
View on GitHub
Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models
☆143Sep 4, 2022Updated 3 years ago
asomoza / mellon-modular-diffusers
View on GitHub
☆11May 14, 2025Updated last year
contrebande-labs / charred
View on GitHub
CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell
☆14May 28, 2023Updated 3 years ago
raylin1000 / drop-bert
View on GitHub
NABERT model for solving the DROP dataset
☆26Jul 1, 2019Updated 7 years ago