Complete implementation of Llama2 with/without KV cache & inference π
β49May 24, 2024Updated last year
Alternatives and similar repositories for Meta-llama
Users that are interested in Meta-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Making of cuda kernelβ16May 27, 2025Updated 10 months ago
- Building GPT ...β18Dec 1, 2024Updated last year
- The repository will contain a list of projects which we will work on while reading the books of Natural Language Processing & Transformerβ¦β73Nov 12, 2023Updated 2 years ago
- Table detection with Florence.β15Jul 11, 2024Updated last year
- Direct Preference Optimization Implementationβ17Feb 1, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GenAI Experimentationβ59Mar 12, 2026Updated last month
- A collection of fine-tuning notebooks!β31Oct 5, 2023Updated 2 years ago
- A curated list of research in Nepali Natural Language Processingβ96Feb 10, 2023Updated 3 years ago
- I will implement Fastai in each projects present in this repository.β65Jul 12, 2023Updated 2 years ago
- β11Aug 21, 2023Updated 2 years ago
- A simple Google AppEngine tool for monitoring a small number of webservers and notifying you of downtime using email or Prowl push notifiβ¦β44Jun 1, 2012Updated 13 years ago
- Frappe client written in Goβ13Sep 24, 2020Updated 5 years ago
- A portal all about competitive programming and problem solvingβ12Nov 1, 2019Updated 6 years ago
- Text simplification for a better world: Deep-Martin Transformer π€β22Sep 25, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Contains the summaries and notes on a variety of DL papers/blogsβ12Jul 30, 2024Updated last year
- β14Aug 15, 2024Updated last year
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.β21Jan 24, 2025Updated last year
- Code, notebooks, and other material for FuturePath AI's training course on Generative AIβ12Apr 24, 2025Updated 11 months ago
- All my experiments with the various transformers and various transformer frameworks availableβ14Apr 30, 2021Updated 4 years ago
- β15Jul 9, 2025Updated 9 months ago
- A crowdsourced list of shared tasksβ20Mar 1, 2024Updated 2 years ago
- Pytorch implementation for ICLR24:"Online GNN Evaluation Under Test-Time Graph Distribution Shifts"β16Mar 23, 2024Updated 2 years ago
- LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencβ¦β727Mar 13, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β46Jan 24, 2024Updated 2 years ago
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5β16Sep 19, 2024Updated last year
- #66DaysOfData challenge in Financial Machine Learning and NLPβ24Jun 14, 2025Updated 10 months ago
- Personal Website Dr. Juan Camilo Orduzβ17Updated this week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Sep 5, 2024Updated last year
- β10Jul 21, 2023Updated 2 years ago
- run deepseek v3 on a single node. Drops unused experts from memory.β16Jan 26, 2025Updated last year
- β10Nov 6, 2024Updated last year
- β17Mar 12, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- High Performance Int8 GEMM Kernels for SM80 and later GPUs.β22Mar 11, 2025Updated last year
- Token-free Language Modeling with ByGPT5 & Friends!β12Jul 18, 2025Updated 9 months ago
- Repository with QPSICE models dedicated to Power Electronicsβ13Mar 20, 2024Updated 2 years ago
- A complete Retrieval-Augmented Generation (RAG) application that demonstrates modern AI capabilities for answering questions about Ultimaβ¦β49Oct 24, 2025Updated 5 months ago
- AI Based mock interviews for preparing for tech jobsβ54Apr 8, 2026Updated last week
- β10Nov 23, 2020Updated 5 years ago
- Arabic nested named entity recognitionβ46Mar 10, 2025Updated last year