vedaldi/micro_llama

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vedaldi/micro_llama)

vedaldi / micro_llama

A tiny, didactical implementation of LLAMA 3

☆42

Alternatives and similar repositories for micro_llama

Users that are interested in micro_llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ttumiel / minRLHF
View on GitHub
Minimal RLHF implementation built on top of minGPT.
☆32Jul 4, 2024Updated 2 years ago
DingfanChen / Private-Set
View on GitHub
Official implementation of "Private Set Generation with Discriminative Information" (NeurIPS 2022)
☆18Aug 14, 2023Updated 2 years ago
richjjj / cuvid-tensorrt-multi
View on GitHub
ffmpeg+cuvid+tensorrt+multicamera
☆12Dec 31, 2024Updated last year
chrirupp / cv_course
View on GitHub
☆17Jan 6, 2026Updated 6 months ago
triple-mu / Stable-Diffusion-TensorRT
View on GitHub
Stable Diffusion in TensorRT 8.5+
☆15Mar 19, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
facebookresearch / dmae_st
View on GitHub
Directed masked autoencoders
☆14Mar 25, 2026Updated 4 months ago
triple-mu / HunyuanDiT-TensorRT-libtorch
View on GitHub
HunyuanDiT with TensorRT and libtorch
☆18May 22, 2024Updated 2 years ago
hui-po-wang / ProgFed
View on GitHub
[ICML2022] ProgFed: Effective, Communication, and Computation Efficient Federated Learning by Progressive Training
☆23Oct 17, 2022Updated 3 years ago
AXERA-TECH / SAM-ONNX-AX650-CPP
View on GitHub
☆18Dec 7, 2023Updated 2 years ago
doem97 / metalora
View on GitHub
[CVPR 2025 Highlight] Meta LoRA / MetaPEFT: Meta-Learning Hyperparameters for Parameter-Efficient Fine-Tuning (LoRA, Adapter, Prompt Tuni…
☆18Mar 4, 2026Updated 4 months ago
Jyxarthur / appear-refine
View on GitHub
[ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …
☆13Oct 23, 2024Updated last year
ibaiGorordo / onnx-perf-test
View on GitHub
A simple Python tool to measure the performance of ONNX models.
☆27Sep 15, 2024Updated last year
moboehle / CoDA-Nets
View on GitHub
Official implementation for the CVPR paper "Convolutional Dynamic Alignment Networks for Interpretable Classifications"
☆30Aug 17, 2023Updated 2 years ago
chaoql / CCF-AIOps-Code
View on GitHub
2024CCF国际AIOps挑战赛-赛道二（GLM4）：基于检索增强的运维知识问答挑战赛解决方案分享。
☆14Jul 5, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yadongJiang / trt_bisenetv1-2
View on GitHub
TensorRT实现BiSeNetV1与BiSeNetV2部署
☆20Apr 14, 2022Updated 4 years ago
Kingsley-Cheng / UCAS
View on GitHub
Courses in UCAS
☆14Jun 12, 2023Updated 3 years ago
yuxiaoranyu / stable_diffusion_trt_triton
View on GitHub
☆20Dec 29, 2023Updated 2 years ago
michimoeller / liftingLayers
View on GitHub
☆12Dec 8, 2022Updated 3 years ago
ironartisan / awesome-compression1
View on GitHub
模型压缩的小白入门教程
☆22Jul 7, 2024Updated 2 years ago
hui-po-wang / hijackgan
View on GitHub
[CVPR 2021] Pytorch implementation of Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs
☆48Jun 13, 2021Updated 5 years ago
seharanul17 / synthetic-tabular-LLM
View on GitHub
☆16Dec 3, 2024Updated last year
AlexBodner / How_Much_VRAM
View on GitHub
☆101Aug 30, 2024Updated last year
Ginjing-Yuan / QWen2-from_ground_up
View on GitHub
☆22Jul 15, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
google-deepmind / codesembench
View on GitHub
☆16Mar 22, 2024Updated 2 years ago
hopef / llama3_chat
View on GitHub
Llama3 Streaming Chat Sample
☆22Apr 24, 2024Updated 2 years ago
MingyuanXu / Tree-Invent
View on GitHub
Tree-Invent: A novel molecular generative model constrained with topological tree
☆14Jul 26, 2023Updated 3 years ago
yangxuntu / catt
View on GitHub
☆12Mar 8, 2021Updated 5 years ago
naver-ai / coco-annotation-tool
View on GitHub
☆19Jul 24, 2023Updated 3 years ago
daniel89710 / lightNet
View on GitHub
LightNet is an optimized deep learning framework based on the popular darknet platform. It is optimized to create efficient and high-spee…
☆37Sep 17, 2023Updated 2 years ago
triple-mu / TensorRT2ONNX
View on GitHub
A tool convert TensorRT engine/plan to a fake onnx
☆41Nov 22, 2022Updated 3 years ago
snu-mllab / Efficient-Dataset-Condensation
View on GitHub
Official PyTorch implementation of "Dataset Condensation via Efficient Synthetic-Data Parameterization" (ICML'22)
☆115Oct 18, 2023Updated 2 years ago
raymond1123 / hgemm
View on GitHub
☆30Nov 16, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
nicklashansen / adaptive-learning-rate-schedule
View on GitHub
PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.
☆12Jan 15, 2020Updated 6 years ago
yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
LAION-AI / laion50BU
View on GitHub
Un-*** 50 billions multimodality dataset
☆24Sep 14, 2022Updated 3 years ago
globaledgesoft / Unsupported-Operation-Development-in-SNPE
View on GitHub
This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…
☆10Oct 4, 2021Updated 4 years ago
thuml / ForkMerge
View on GitHub
Code release of paper "ForkMerge: Mitigating Negative Transfer in Auxiliary-Task Learning" (NeurIPS 2023)
☆17Dec 30, 2023Updated 2 years ago
mint-deeplearning / mosse_tracker
View on GitHub
mosse_tracker
☆13Feb 18, 2020Updated 6 years ago
DataXujing / YOLOv12-TensorRT
View on GitHub
YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现
☆14Mar 5, 2025Updated last year