Complete implementation of Llama2 with/without KV cache & inference π
β49May 24, 2024Updated last year
Alternatives and similar repositories for Meta-llama
Users that are interested in Meta-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Making of cuda kernelβ16May 27, 2025Updated 10 months ago
- Building GPT ...β18Dec 1, 2024Updated last year
- Table detection with Florence.β15Jul 11, 2024Updated last year
- Direct Preference Optimization Implementationβ17Feb 1, 2024Updated 2 years ago
- synthetic data for mlβ25Jan 30, 2025Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- GenAI Experimentationβ59Mar 12, 2026Updated 2 weeks ago
- β13Jan 7, 2022Updated 4 years ago
- β12Feb 12, 2024Updated 2 years ago
- β14Jul 28, 2024Updated last year
- Helm charts for a kubernetes deployment of the saleor ecommerce platformβ14Dec 31, 2020Updated 5 years ago
- β14May 9, 2024Updated last year
- A repository to showcase the upskilling of self in theoretical & applied aspects of data science during the ongoing sabbatical of 23 montβ¦β16Dec 17, 2023Updated 2 years ago
- Code, notebooks, and other material for FuturePath AI's training course on Generative AIβ12Apr 24, 2025Updated 11 months ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.β20Jan 24, 2025Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- All my experiments with the various transformers and various transformer frameworks availableβ14Apr 30, 2021Updated 4 years ago
- β15Jul 9, 2025Updated 8 months ago
- Pytorch implementation for ICLR24:"Online GNN Evaluation Under Test-Time Graph Distribution Shifts"β16Mar 23, 2024Updated 2 years ago
- LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencβ¦β724Mar 13, 2026Updated 2 weeks ago
- Very little code to make PyTorch Lightning modelsβ16Jan 17, 2024Updated 2 years ago
- β45Jan 24, 2024Updated 2 years ago
- LLM implementation one matrix multiplication at a timeβ13Aug 8, 2024Updated last year
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5β16Sep 19, 2024Updated last year
- Sound Separation, Omni modalβ28Sep 15, 2025Updated 6 months ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Winning solution to the IEEE PELS MagNet challenge 2023 (1st Place Model Performance Category)β15Nov 4, 2024Updated last year
- β10Jul 21, 2023Updated 2 years ago
- run deepseek v3 on a single node. Drops unused experts from memory.β16Jan 26, 2025Updated last year
- β10Nov 6, 2024Updated last year
- β17Mar 12, 2025Updated last year
- Token-free Language Modeling with ByGPT5 & Friends!β12Jul 18, 2025Updated 8 months ago
- MongoDB-ODM, NOSQL databases in Python, designed for simplicity, compatibility, and robustness.β21Dec 7, 2025Updated 3 months ago
- This is a clone repository on Chat GPT(open AI chatbot) Clone using React.js with Tailwind CSSβ14Apr 28, 2024Updated last year
- Zero Dependency LibTorch Safetensors Loading and Storing in C++β23Jul 12, 2024Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"β11Sep 20, 2024Updated last year
- PyTorch extension enabling direct access to cuDNN-accelerated C++ convolution functions.β13Mar 14, 2021Updated 5 years ago
- Rust Hack and Learn in Berlin Challenge to implement a Site in as many Frameworks as possible. This implementation is : Leptos - Axum - Sβ¦β16Mar 7, 2026Updated 3 weeks ago
- Curated list of Moroccans publishing in the most prestigious AI conferencesβ11Oct 14, 2024Updated last year
- A tiny reinforcement learning codebase for continuous control, built on top of JAX.β15Mar 28, 2023Updated 3 years ago
- β19Jan 11, 2024Updated 2 years ago
- β30Mar 18, 2024Updated 2 years ago