LexiYin-mh / YouTube-Abstrator-Python-NLPLinks
A Python-based tool, trained on the state-of-the-art Google Pegasus model, specializing in generating abstracts from given YouTube video ID.
☆10Updated 2 years ago
Alternatives and similar repositories for YouTube-Abstrator-Python-NLP
Users that are interested in YouTube-Abstrator-Python-NLP are comparing it to the libraries listed below
Sorting:
- Large Language Model (LLM) Systems Paper List☆1,802Updated last week
- Efficient Device Scheduling with Multi-Job Federated Learning☆21Updated 2 years ago
- ☆17Updated this week
- TinyML and Efficient Deep Learning Computing☆19Updated last year
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆16Updated 11 months ago
- 📰 Must-read papers on KV Cache Compression (constantly updating 🤗).☆658Updated 4 months ago
- Code for the 9/6 Hackathon☆52Updated 5 months ago
- ☆28Updated 6 months ago
- Advanced Scalable Systems for X☆77Updated this week
- Building blocks for foundation models.☆599Updated 2 years ago
- ☆31Updated 10 months ago
- ☆628Updated 3 weeks ago
- Awesome LLM compression research papers and tools.☆1,771Updated 3 months ago
- FlashInfer: Kernel Library for LLM Serving☆4,935Updated this week
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆1,117Updated 2 weeks ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆2,705Updated this week
- Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline mod…☆617Updated last year
- Disaggregated serving system for Large Language Models (LLMs).☆771Updated 10 months ago
- QuantEase, a layer-wise quantization framework, frames the problem as discrete-structured non-convex optimization. Our work leverages Coo…☆19Updated last year
- Curated collection of papers in machine learning systems☆503Updated last month
- ☆13Updated 3 months ago
- Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.☆411Updated 11 months ago
- Distributed Compiler based on Triton for Parallel Systems☆1,332Updated last week
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)☆792Updated 3 weeks ago
- Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.☆99Updated last year
- ☆64Updated last month
- Perplexity GPU Kernels☆560Updated 3 months ago
- ☆15Updated 11 months ago
- LLM KV cache compression made easy☆876Updated 2 weeks ago
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆238Updated last week