Inference code for LLaMA models
☆21Apr 3, 2025Updated last year
Alternatives and similar repositories for llama
Users that are interested in llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch/XLA SPMD Test code in Google TPU☆23Apr 3, 2024Updated 2 years ago
- TPU support for the fastai library☆14Apr 15, 2021Updated 5 years ago
- A basic Docker-based installation of TVM☆11Jun 23, 2022Updated 3 years ago
- Slides from 2021-12-15 talk, "TVM Developer Bootcamp – Writing Hardware Backends"☆11Jan 20, 2022Updated 4 years ago
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture☆27Feb 3, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Visualize TVM Relay program graph☆12Nov 19, 2019Updated 6 years ago
- ☆18Jan 9, 2018Updated 8 years ago
- Implementation of the Adaptive Resonance Theory (ART) architectures - Fuzzy ART and Fuzzy ARTMAP - for pattern recognition☆11Jan 6, 2019Updated 7 years ago
- A C compiler with SSA-based backend optimzation☆15Mar 19, 2016Updated 10 years ago
- ☆10Jul 28, 2021Updated 4 years ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 9 months ago
- 深度学习课程自己所做答案☆10Apr 23, 2018Updated 8 years ago
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- Source code for the AI2 Reasoning Challenge (ARC) submission.☆16Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Jan 9, 2019Updated 7 years ago
- converts Vertex AI API to OpenAI API format.☆12Oct 23, 2024Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated 2 years ago
- ☆14Apr 16, 2021Updated 5 years ago
- Hugo SEO Module☆10Jan 7, 2026Updated 5 months ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated 3 months ago
- ☆33Apr 19, 2025Updated last year
- Template Filling with Generative Transformers☆22Jun 8, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- renders a canvas spritesheet for use with pixi.js☆11Dec 4, 2022Updated 3 years ago
- Test using WebWorkers to run D3 geo projection☆10Jul 2, 2018Updated 7 years ago
- User modeling for sarcasm detection on Reddit corpus from Khodak et al. Published in EMNLP 2018.☆11Aug 25, 2018Updated 7 years ago
- ☆12Dec 30, 2020Updated 5 years ago
- A Pytorch implementation of Collaborative Metric Learning (CML)☆11Oct 13, 2020Updated 5 years ago
- Ion Range Slider for AngularJS https://github.com/IonDen/ion.rangeSlider☆12Aug 24, 2018Updated 7 years ago
- My defold template☆11Jul 6, 2025Updated 11 months ago
- A port of opensteer as a native extension for Defold.☆11May 9, 2024Updated 2 years ago
- RTS sample project for the Defold game engine☆15May 29, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Sep 26, 2023Updated 2 years ago
- A quick and dirty script for creating complete Godot TileSets without hours of endless clicking.☆10Feb 14, 2023Updated 3 years ago
- a icheck directive like jQuery iCheck for angularjs☆11Jul 25, 2018Updated 7 years ago
- A collection of reproducible inference engine benchmarks☆38Apr 22, 2025Updated last year
- Code for SegTree Transformer (ICLR-RLGM 2019).☆27Nov 12, 2019Updated 6 years ago
- A simple Hubot adapter for gitter.im☆24Jun 21, 2017Updated 8 years ago
- A repackaging of freetype to be easily embedded in Android NDK applications☆18Feb 21, 2011Updated 15 years ago