Inference code for LLaMA models
☆21Apr 3, 2025Updated last year
Alternatives and similar repositories for llama
Users that are interested in llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch/XLA SPMD Test code in Google TPU☆23Apr 3, 2024Updated 2 years ago
- ☆35Nov 11, 2025Updated 5 months ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture☆26Feb 3, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Convert MathML to Latex for OneNote to Markdown☆13Mar 17, 2026Updated last month
- ☆18Jan 9, 2018Updated 8 years ago
- The elegant integration of huggingface/nlp and fastai2 and handy transforms using pure huggingface/nlp☆19Oct 6, 2020Updated 5 years ago
- 深度学习课程自己所做答案☆10Apr 23, 2018Updated 8 years ago
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- ☆12Jan 9, 2019Updated 7 years ago
- [Re-implementation] FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence☆15Jun 29, 2020Updated 5 years ago
- converts Vertex AI API to OpenAI API format.☆12Oct 23, 2024Updated last year
- Joint angle comparison of mediapipe prediction results bvh conversion with ground truth bvh☆11Apr 1, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jun 22, 2016Updated 9 years ago
- ☆126Updated this week
- Hugo SEO Module☆10Jan 7, 2026Updated 3 months ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated last month
- ☆33Apr 19, 2025Updated last year
- Declarative DOM building☆14Feb 19, 2025Updated last year
- renders a canvas spritesheet for use with pixi.js☆11Dec 4, 2022Updated 3 years ago
- Test using WebWorkers to run D3 geo projection☆10Jul 2, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- User modeling for sarcasm detection on Reddit corpus from Khodak et al. Published in EMNLP 2018.☆11Aug 25, 2018Updated 7 years ago
- Ion Range Slider for AngularJS https://github.com/IonDen/ion.rangeSlider☆12Aug 24, 2018Updated 7 years ago
- My defold template☆11Jul 6, 2025Updated 9 months ago
- A port of opensteer as a native extension for Defold.☆11May 9, 2024Updated last year
- RTS sample project for the Defold game engine☆15Feb 3, 2024Updated 2 years ago
- A quick and dirty script for creating complete Godot TileSets without hours of endless clicking.☆10Feb 14, 2023Updated 3 years ago
- a icheck directive like jQuery iCheck for angularjs☆11Jul 25, 2018Updated 7 years ago
- A collection of reproducible inference engine benchmarks☆38Apr 22, 2025Updated last year
- Code for SegTree Transformer (ICLR-RLGM 2019).☆27Nov 12, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A repackaging of freetype to be easily embedded in Android NDK applications☆18Feb 21, 2011Updated 15 years ago
- An implementation of Support Vector Guided Softmax Loss for Face Recognition☆22Apr 9, 2019Updated 7 years ago
- This is a game AI demo implemented using neural networks in Godot 3.5☆11Jan 28, 2023Updated 3 years ago
- Code for the WWW'22 paper "MCL: Mixed-Centric Loss for Collaborative Filtering"☆12Feb 4, 2022Updated 4 years ago
- ☆15Aug 16, 2019Updated 6 years ago
- Interact with remote git checkouts using Fork, and more!☆13Oct 22, 2024Updated last year
- A python implementation of the neural network joint language model and an extension of it using global source context.☆11May 17, 2017Updated 8 years ago