☆19May 11, 2024Updated last year
Alternatives and similar repositories for LargeScale
Users that are interested in LargeScale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Oct 17, 2024Updated last year
- A small repository demonstrating the use of Webdataset and Imagenet☆17Dec 19, 2023Updated 2 years ago
- ☆14Mar 26, 2020Updated 6 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 10 months ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Jul 20, 2023Updated 2 years ago
- ☆11Dec 26, 2025Updated 3 months ago
- ☆13May 12, 2025Updated 10 months ago
- ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents, NeurIPS 2025☆35Nov 15, 2025Updated 4 months ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- ☆11Nov 21, 2024Updated last year
- pytorch implementation of Funnel Activation (FReLU)☆16Aug 16, 2020Updated 5 years ago
- Toolchain built around the Megatron-LM for Distributed Training☆92Mar 23, 2026Updated 2 weeks ago
- ☆11Sep 18, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Directed masked autoencoders☆14Mar 25, 2026Updated 2 weeks ago
- Automated bottleneck detection and solution orchestration☆20Feb 24, 2026Updated last month
- Translate Python and JavaScript into MLIR☆17Aug 27, 2022Updated 3 years ago
- Artificial Intelligence project☆10Mar 11, 2016Updated 10 years ago
- Towards Real-Time Multi-Object Tracking☆29May 18, 2021Updated 4 years ago
- Multi-span Style Extraction for Generative Reading Comprehension☆10Apr 2, 2021Updated 5 years ago
- Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention☆53Updated this week
- ☆36Jul 18, 2023Updated 2 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17May 31, 2023Updated 2 years ago
- ☆10Jun 22, 2020Updated 5 years ago
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆17May 24, 2024Updated last year
- In this repository I will be running various experiments on finetune different parts for xtts☆15Jun 22, 2024Updated last year
- Repository for Transfer Learning using Deep CNNs trained with synthetic images☆16Jun 21, 2017Updated 8 years ago
- ☆38Aug 7, 2025Updated 8 months ago
- Paper reading: Jamba — Hybrid Transformer-Mamba LM (SSM → S4 → S6 → Jamba)☆15May 22, 2024Updated last year
- ☆24Apr 29, 2025Updated 11 months ago
- ☆14Feb 18, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 基于cnn的视频动作分类教程。☆11Aug 17, 2020Updated 5 years ago
- ☆14Mar 13, 2026Updated 3 weeks ago
- NLP 相关岗位 笔试面试资源汇总☆16Jun 17, 2021Updated 4 years ago
- Sample solution to build a deployment pipeline for Amazon SageMaker.☆13Jul 18, 2022Updated 3 years ago
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- Leveraging Ontological Schema Information in Embedding Models for Knowledge Graphs☆14Jun 16, 2015Updated 10 years ago
- My Detectron2 Packages☆11Nov 21, 2025Updated 4 months ago