Inference code for LLaMA models
☆21Apr 3, 2025Updated last year
Alternatives and similar repositories for llama
Users that are interested in llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Nov 11, 2025Updated 7 months ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Aug 19, 2023Updated 2 years ago
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago
- Slides from 2021-12-15 talk, "TVM Developer Bootcamp – Writing Hardware Backends"☆11Jan 20, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture☆27Feb 3, 2026Updated 4 months ago
- minimal diffusion transformer in pytorch.☆17Oct 6, 2024Updated last year
- Visualize TVM Relay program graph☆12Nov 19, 2019Updated 6 years ago
- torchprime is a reference model implementation for PyTorch on TPU.☆48Mar 3, 2026Updated 3 months ago
- PyTorch code for ROLL, a knowledge-based video story question answering model.☆21Sep 29, 2020Updated 5 years ago
- The elegant integration of huggingface/nlp and fastai2 and handy transforms using pure huggingface/nlp☆19Oct 6, 2020Updated 5 years ago
- A C compiler with SSA-based backend optimzation☆15Mar 19, 2016Updated 10 years ago
- Notes of ADRL course taught at IISC as part of MTech AI curriculum☆15Nov 30, 2024Updated last year
- Code for Scene Graph Generation with External Knowledge and Image Reconstruction☆25Dec 1, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆38Aug 27, 2025Updated 10 months ago
- Notes of PRNN course taught at IISC as part of MTech AI curriculum☆19Nov 30, 2024Updated last year
- PyTorch centric eager mode debugger☆48Dec 16, 2024Updated last year
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- Source code for the AI2 Reasoning Challenge (ARC) submission.☆16Dec 8, 2022Updated 3 years ago
- ☆12Jan 9, 2019Updated 7 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated 2 years ago
- for practicing FastAPI, with me, others are join this project☆12Mar 1, 2022Updated 4 years ago
- ☆14Apr 16, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Test Collection of Computer Science Papers for Faceted Query by Example☆23Nov 28, 2021Updated 4 years ago
- ☆11Jun 22, 2016Updated 10 years ago
- Hugo SEO Module☆10Jan 7, 2026Updated 5 months ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- Training tiny models to prove hard theorems☆78Mar 5, 2026Updated 3 months ago
- ☆33Apr 19, 2025Updated last year
- A plugin for godot to use FontAwesome's icons in your game.☆10Jan 4, 2020Updated 6 years ago
- Template Filling with Generative Transformers☆22Jun 8, 2021Updated 5 years ago
- Declarative DOM building☆14Feb 19, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- renders a canvas spritesheet for use with pixi.js☆11Dec 4, 2022Updated 3 years ago
- Test using WebWorkers to run D3 geo projection☆10Jul 2, 2018Updated 7 years ago
- ☆12Dec 30, 2020Updated 5 years ago
- A Pytorch implementation of Collaborative Metric Learning (CML)☆11Oct 13, 2020Updated 5 years ago
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆64Jun 18, 2026Updated last week
- Ion Range Slider for AngularJS https://github.com/IonDen/ion.rangeSlider☆12Aug 24, 2018Updated 7 years ago
- A port of opensteer as a native extension for Defold.☆11May 9, 2024Updated 2 years ago