ianmkim/gpt_llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ianmkim/gpt_llm)

ianmkim / gpt_llm

Multi-GPU setup for inference with GPT NeoX 20B and OPT-30B models in huggingface

☆26

Alternatives and similar repositories for gpt_llm

Users that are interested in gpt_llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

latynt / ans
View on GitHub
Arabic News Stance Corpus
☆11Feb 5, 2021Updated 5 years ago
hieudx149 / X-RetroMAE
View on GitHub
Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder
☆10Mar 16, 2023Updated 3 years ago
google / t5patches
View on GitHub
T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.
☆12May 31, 2024Updated 2 years ago
sho-luv / gpt_tools
View on GitHub
These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming
☆14Feb 24, 2024Updated 2 years ago
tengxiaoliu / LM_skip
View on GitHub
[NeurIPS 2024] Can Language Models Learn to Skip Steps?
☆21Jan 25, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
amazon-science / wqa-multi-sentence-inference
View on GitHub
This repository contains code used for our Multi Sentence Inference NAACL'22 paper.
☆12Mar 6, 2023Updated 3 years ago
amazon-science / unique-batches
View on GitHub
☆11Aug 13, 2024Updated last year
Tomiinek / Aargh
View on GitHub
☆12Jan 2, 2024Updated 2 years ago
CreaLabs / Enhanced-BGE-M3-with-CLP-and-MoE
View on GitHub
This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…
☆11Dec 27, 2024Updated last year
thunlp / CSS-LM
View on GitHub
CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models
☆11Jul 1, 2023Updated 3 years ago
ShaojieJiang / CT-Loss
View on GitHub
The contrastive token loss function for reducing generative repetition of autoregressive neural language models.
☆13May 11, 2022Updated 4 years ago
ychen-stat-ml / kernel-adapters
View on GitHub
Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…
☆11Feb 6, 2023Updated 3 years ago
UKPLab / incorporating-relevance
View on GitHub
Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…
☆14Mar 30, 2026Updated 3 months ago
UKPLab / AdaSent
View on GitHub
This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…
☆16Jun 3, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
declare-lab / WikiDes
View on GitHub
A Wikipedia-based summarization dataset
☆14Mar 27, 2023Updated 3 years ago
amazon-science / irgr
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
Mihir3009 / In-BoXBART
View on GitHub
In-BoXBART: Get Instructions into Biomedical Multi-task Learning
☆15Aug 23, 2022Updated 3 years ago
MeetElise / surprise-similarity
View on GitHub
A context-aware embedding similarity score
☆11Aug 23, 2023Updated 2 years ago
izhx / uni-rep
View on GitHub
Code for embedding and retrieval research.
☆16Oct 24, 2023Updated 2 years ago
dianaow / d3-network-time
View on GitHub
d3 plugin to create a temporal network visualization
☆18Jan 6, 2023Updated 3 years ago
JohnTailor / BertSenClu
View on GitHub
Topic Model based on Pretrained Sentence Embeddings (with BERT)
☆13Feb 8, 2023Updated 3 years ago
MikeWangWZHL / Zemi
View on GitHub
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆15May 3, 2023Updated 3 years ago
yilei-wu / SupReMix
View on GitHub
This repository contains the implementation code for paper: Mixup Your Own Pairs
☆11Oct 1, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wilson1yan / VideoGPT-Paper
View on GitHub
☆18Apr 15, 2021Updated 5 years ago
xjtuYW / PNP
View on GitHub
Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery
☆15Apr 28, 2024Updated 2 years ago
Sreyan88 / Disfluency-Detection-with-Span-Classification
View on GitHub
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆14Jun 6, 2023Updated 3 years ago
namiyousef / argument-mining
View on GitHub
Repository for NLP project. Name to be changed when we decide on a project
☆16Apr 19, 2022Updated 4 years ago
Crisp-Unimib / MERLIN
View on GitHub
MERLIN is a global, model-agnostic, contrastive explainer for any tabular or text classifier. It provides contrastive explanations of how…
☆19Sep 15, 2023Updated 2 years ago
MichaelZhouwang / Sequence_Span_Rewriting
View on GitHub
Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
☆17Nov 30, 2021Updated 4 years ago
jys5609 / GPT-Critic
View on GitHub
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems
☆10Jul 7, 2022Updated 4 years ago
maxschelski / pytorch-cluster-metrics
View on GitHub
Pytorch implementation of standard metrics for clustering
☆10Mar 21, 2023Updated 3 years ago
tlkh / t2t-tuner
View on GitHub
Convenient Text-to-Text Training for Transformers
☆18Dec 10, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
amirmohan / SPROUT
View on GitHub
SPROUT is a machine learning tool to predict the DNA repair outcome in CRISPR experiments.
☆16Jul 17, 2021Updated 5 years ago
kookeej / CORAL
View on GitHub
Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"
☆14Sep 9, 2025Updated 10 months ago
juexinlin / dense_retrieval_relevance_filter
View on GitHub
☆15Aug 2, 2024Updated last year
abhinav-bohra / Emotional-Analysis-Multitasking-Framework
View on GitHub
My implementation of the research paper - All-in-One: Emotion, Sentiment and Intensity Prediction using a Multi-task Ensemble Framework
☆17Apr 16, 2021Updated 5 years ago
terrierteam / pyterrier_t5
View on GitHub
☆17Apr 30, 2026Updated 2 months ago
Heidelberg-NLP / CCKG
View on GitHub
Repository to create CCKGs from the paper "Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-…
☆11May 23, 2025Updated last year
ielab / CharacterBERT-DR
View on GitHub
The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…
☆16May 4, 2022Updated 4 years ago