yeahjack / chatgpt_zulip_bot

Zulip-based bot to respond users by ChatGPT

☆16

Alternatives and similar repositories for chatgpt_zulip_bot:

Users that are interested in chatgpt_zulip_bot are comparing it to the libraries listed below

quantum-compiler / quartz
The Quartz Quantum Compiler
☆79Updated this week
Tonanguyxiro / HKUST-GZ_RBM_Research_Proposal
☆27Updated last year
feifeibear / DPSKV3MFU
Estimate MFU for DeepSeekV3
☆14Updated 2 weeks ago
zyxxmu / cam
Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference
☆29Updated 7 months ago
hdong920 / GRIFFIN
☆36Updated 4 months ago
DS3Lab / AC-SGD
Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.
☆27Updated last year
quantum-compiler / atlas
☆12Updated 5 months ago
nmrenyi / happy-lab
A happy way for research!
☆24Updated last year
PKU-SEC-Lab / AdapMoE
Code release for AdapMoE accepted by ICCAD 2024
☆10Updated 2 months ago
mit-han-lab / tinychat-tutorial
☆59Updated 2 months ago
YangletLiu / RL4QC
☆13Updated 8 months ago
NonvolatileMemory / flash_tree_attn
☆16Updated 3 weeks ago
yifanycc / loretta
[NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
☆33Updated last week
PipeFusion / PipeFusion
A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters
☆37Updated 5 months ago
SqueezeAILab / SqueezedAttention
SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference
☆36Updated 2 months ago
APEXLAB / CodeApex
☆48Updated last year
teelinsan / parallel-decoding
Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"
☆114Updated 10 months ago
S4Plus / QuantumCourse
☆23Updated 2 years ago
BaiTheBest / SparseLLM
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)
☆51Updated last month
uwsampl / paper-agents
☆13Updated last month
zhaochenyang20 / Cyber-Security
Course notes for Cyber Security (THUCST 2023 Spring)
☆26Updated last year
Lucky-Lance / SPP
[ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
☆18Updated 7 months ago
HuangOwen / RoLoRA
[EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
☆27Updated 3 months ago
alibaba / YAQCS-arch
☆14Updated last year
microsoft / SeerAttention
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs
☆21Updated last month
machilusZ / FastGen
This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
☆32Updated 5 months ago
pan-x-c / EE-LLM
EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).
☆52Updated 7 months ago
luliyucoordinate / cute-flash-attention
Implement Flash Attention using Cute.
☆65Updated last month