kay-cottage / Mini_Reverse_Proxy
不到100行代码实现一个Python迷你内网穿透、反向正向代理小工具
☆11Updated last year
Alternatives and similar repositories for Mini_Reverse_Proxy:
Users that are interested in Mini_Reverse_Proxy are comparing it to the libraries listed below
- 研究生课《网络大数据管理理论和应用》大作业项目代码☆13Updated 2 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 2 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- ☆52Updated this week
- 应用系统体系架构☆23Updated last year
- Summary of system papers/frameworks/codes/tools on training or serving large model☆56Updated last year
- torch.compile artifacts for common deep learning models, can be used as a learning resource for torch.compile☆16Updated last year
- ☆11Updated 3 years ago
- Efficient inference of large language models.☆146Updated 3 months ago
- Tiny C++11 GPT-2 inference implementation from scratch☆57Updated 3 months ago
- Yet another toy processor implementation☆15Updated 3 years ago
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆13Updated 9 months ago
- an interpreted functional programming language☆13Updated 4 months ago
- GPTQ inference TVM kernel☆38Updated 11 months ago
- CS 346 RedBase Project (Stanford)☆37Updated 9 years ago
- ☆25Updated last year
- A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters☆44Updated 8 months ago
- 如何做技术演讲(how to give a talk)的slide☆21Updated 4 years ago
- Elixir: Train a Large Language Model on a Small GPU Cluster☆14Updated last year
- Linux io_uring based c++ 20 coroutine library☆29Updated 2 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆24Updated 3 months ago
- Benchmark tests supporting the TiledCUDA library.☆15Updated 4 months ago
- Database project☆8Updated 6 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆19Updated last year
- A TVM-like CUDA/C code generator.☆9Updated 3 years ago
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆15Updated 2 months ago
- A JavaScript interpreter from scratch, supporting ES5 syntax.☆28Updated 7 months ago
- My tests and experiments with some popular dl frameworks.☆12Updated last month
- Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing☆35Updated 2 months ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆21Updated last month