Inference code for LLaMA models
☆41Mar 13, 2023Updated 2 years ago
Alternatives and similar repositories for llama
Users that are interested in llama are comparing it to the libraries listed below
Sorting:
- BFloat16 Fused Adam Operator for PyTorch☆16Nov 16, 2024Updated last year
- ☆11Dec 22, 2024Updated last year
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- ☆21Mar 3, 2025Updated last year
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆65Updated this week
- Artifacts of EVT ASPLOS'24☆29Mar 6, 2024Updated 2 years ago
- ☆40Updated this week
- SmartThings DTHs for Sage devices☆12Nov 15, 2016Updated 9 years ago
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated last year
- A command line utility to manage the configuration of a system's high performance network interfaces for RoCE deployments☆35Jul 25, 2023Updated 2 years ago
- Atari-DRQN (keras ver.)☆33Oct 1, 2018Updated 7 years ago
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- Reference CAD files for NEON integrations☆10Aug 9, 2024Updated last year
- 2D time-domain isotropic (visco)elastic FD modeling and full waveform inversion (FWI) code for SH-waves☆13Aug 9, 2020Updated 5 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- ☆11Feb 27, 2024Updated 2 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- ActivityWatch watcher for Hyprland☆16Jun 3, 2025Updated 9 months ago
- Performance Counter Reader☆11Sep 14, 2022Updated 3 years ago
- Blocks text, image and videos from Reddit, Youtube, WhatsApp and Facebook Android apps for given keywords. Useful to block content relate…☆10Oct 14, 2022Updated 3 years ago
- PyTorch library to accelerate super-resolution research☆11Jun 23, 2024Updated last year
- Aurora Bubble Generator TOX for TouchDesigner☆11Jan 26, 2023Updated 3 years ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆67Feb 27, 2026Updated last week
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆24Feb 21, 2026Updated 2 weeks ago
- Orchestration middleware for Home Assistant + Ollama: enables 8-20B models to handle complex multi-intent commands through intelligent ta…☆23Feb 6, 2026Updated last month
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 6 months ago
- ☆13Sep 5, 2024Updated last year
- ☆10Feb 25, 2026Updated last week
- GPU based 2D elastic FWI☆12Mar 6, 2018Updated 8 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- ☆12Jul 10, 2023Updated 2 years ago
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 10 months ago
- ☆12Nov 12, 2018Updated 7 years ago
- Sentiment analysis for movie reviews☆10Jun 22, 2015Updated 10 years ago
- Face sticker effects☆11Jan 2, 2019Updated 7 years ago