Complete implementation of Llama2 with/without KV cache & inference π
β49May 24, 2024Updated 2 years ago
Alternatives and similar repositories for Meta-llama
Users that are interested in Meta-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python implementation of Avro Phoneticβ10Feb 25, 2025Updated last year
- synthetic data for mlβ25Jan 30, 2025Updated last year
- GenAI Experimentationβ59Mar 12, 2026Updated 2 months ago
- A collection of fine-tuning notebooks!β31Oct 5, 2023Updated 2 years ago
- I will implement Fastai in each projects present in this repository.β65Jul 12, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β13Jan 7, 2022Updated 4 years ago
- triton ver of gqa flash attn, based on the tutorialβ12Aug 4, 2024Updated last year
- I am working on implementing Machine Learning Algorithms from scratch.β12Apr 12, 2021Updated 5 years ago
- Find, list, and inspect processes from Go (golang).β10Feb 4, 2018Updated 8 years ago
- β16Jul 28, 2024Updated last year
- β14May 9, 2024Updated 2 years ago
- β12May 15, 2025Updated last year
- This library provides you with an easy way to create and run Hive Agents.β19Nov 9, 2024Updated last year
- Text simplification for a better world: Deep-Martin Transformer π€β22Sep 25, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Contains the summaries and notes on a variety of DL papers/blogsβ12Jul 30, 2024Updated last year
- β14Aug 15, 2024Updated last year
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.β21Jan 24, 2025Updated last year
- All my experiments with the various transformers and various transformer frameworks availableβ14Apr 30, 2021Updated 5 years ago
- β15Jul 9, 2025Updated 10 months ago
- A crowdsourced list of shared tasksβ20Mar 1, 2024Updated 2 years ago
- LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencβ¦β729Mar 13, 2026Updated 2 months ago
- Get started in the world of data. The community of #66daysofdata collaborated to bring you a roadmap to get you started in to the varioβ¦β11Nov 24, 2020Updated 5 years ago
- Unsupervised Deep Embedding for Clustering Analysis (DEC)β28Oct 11, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Training HuggingFace models using fastaiβ11Jul 22, 2021Updated 4 years ago
- Fold/unfold markdown section. | ζε /ε±εΌ Markdown η« θγβ22Nov 24, 2025Updated 6 months ago
- LLM implementation one matrix multiplication at a timeβ13Aug 8, 2024Updated last year
- Telegram bot made with Python to get notified when visa slots are availableβ14Mar 17, 2026Updated 2 months ago
- #66DaysOfData challenge in Financial Machine Learning and NLPβ24Jun 14, 2025Updated 11 months ago
- Winning solution to the IEEE PELS MagNet challenge 2023 (1st Place Model Performance Category)β15Nov 4, 2024Updated last year
- run deepseek v3 on a single node. Drops unused experts from memory.β16Jan 26, 2025Updated last year
- β10Jul 21, 2023Updated 2 years ago
- β10Nov 6, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is a demo code for RF pulses in 2022 ISMRMβ10May 6, 2022Updated 4 years ago
- An optimization algorithm for the design of pneumatic soft robots.β14Jul 30, 2025Updated 9 months ago
- Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuitsβ42Jan 8, 2026Updated 4 months ago
- Token-free Language Modeling with ByGPT5 & Friends!β12Jul 18, 2025Updated 10 months ago
- Repository with QPSICE models dedicated to Power Electronicsβ15Mar 20, 2024Updated 2 years ago
- RestAI's Frontendβ22Sep 4, 2025Updated 8 months ago
- Code for "Coherent Probabilistic Aggregate Queries on Long-horizon Forecasts", IJCAI 2022β18May 27, 2022Updated 4 years ago