APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding
☆14Jul 22, 2024Updated last year
Alternatives and similar repositories for APAR
Users that are interested in APAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Wordle game written in Rust, refined. Play in browser with the power of WebAssembly! Course project of Programming Training, Tsinghua U…☆17Jul 10, 2024Updated last year
- ☆10Mar 3, 2026Updated 3 weeks ago
- >>> 异常中断 + 虚存页表 + 分支预测 + TLB + Cache + Flash + VGA + uCore☆20Nov 17, 2023Updated 2 years ago
- A simple gitlab/github web hooks daemon☆16Feb 6, 2026Updated last month
- ☆12Dec 16, 2025Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Some material for THUCS courses.☆51Jul 4, 2022Updated 3 years ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 2 months ago
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆32Mar 10, 2026Updated 2 weeks ago
- PyTorch unoffical implementation of "PoE-GAN : Multimodal Conditional Image Synthesis with Product-of-Experts GANs"☆14Mar 29, 2023Updated 2 years ago
- Package for deploying deep learning models from TAO Toolkit☆24Updated this week
- ☆21Apr 9, 2024Updated last year
- Agent-WebVoyager autonomously navigates the web like a human, performing tasks without specific APIs. It uses visual cues and intelligent…☆14Feb 13, 2024Updated 2 years ago
- a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages☆11Feb 9, 2025Updated last year
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆16Sep 20, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated last year
- This is the official PyTorch implementation of ASAG (ICCV 2023).☆18Sep 9, 2023Updated 2 years ago
- An experimental modular OS written in Rust.☆17Feb 11, 2025Updated last year
- ☆17Apr 11, 2025Updated 11 months ago
- PyTorch code for JSTSP2021 paper "Accurate and Lightweight Image Super-Resolution with Model-Guided Deep Unfolding Network""☆12Nov 21, 2020Updated 5 years ago
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Feb 12, 2025Updated last year
- An RGB to Spectrum Conversion for Reflectances - Smits (1999)☆13Jan 26, 2020Updated 6 years ago
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- A curated list of recent efficient video generation methods.☆61Oct 7, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Openreviewers: Multi Agent Academic Review Simulation System☆23Mar 2, 2024Updated 2 years ago
- Receiver operating characteristic curve (ROC) computation code in C++☆11Jul 17, 2017Updated 8 years ago
- 基于中文 GPT2 预训练模型的语句困惑度计算☆15Apr 20, 2023Updated 2 years ago
- Libp2p bindings for Python☆12Jan 26, 2026Updated 2 months ago
- Chinese–English Stopword List (3,076 entries, including special symbols)☆22Jan 7, 2026Updated 2 months ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- ☆21Mar 18, 2026Updated last week
- Principal Feature Visualization for convolutional neural networks☆11Jan 28, 2021Updated 5 years ago
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆18May 21, 2025Updated 10 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆15Sep 15, 2022Updated 3 years ago
- Activation-aware Singular Value Decomposition for Compressing Large Language Models☆91Oct 22, 2024Updated last year
- ☆13Feb 11, 2019Updated 7 years ago
- An efficient CNN for spectral reconstruction from RGB images☆13May 10, 2018Updated 7 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 2 years ago
- An implementation of the Sequence to Sequence model using the Lasagne library (WIP)☆12Aug 11, 2016Updated 9 years ago