RahulSChand / llama2.c-for-dummies
Step by step explanation/tutorial of llama2.c
☆210Updated last year
Related projects ⓘ
Alternatives and complementary repositories for llama2.c-for-dummies
- llama3.cuda is a pure C/CUDA implementation for Llama 3 model.☆309Updated 5 months ago
- Easy and Efficient Quantization for Transformers☆180Updated 4 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆115Updated 10 months ago
- Newsletter bot for 🤗 Daily Papers☆107Updated this week
- OSLO: Open Source for Large-scale Optimization☆174Updated last year
- Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...☆279Updated 10 months ago
- evolve llm training instruction, from english instruction to any language.☆113Updated last year
- Efficient fine-tuning for ko-llm models☆184Updated 8 months ago
- ☆193Updated this week
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆45Updated 8 months ago
- 1-Click is all you need.☆59Updated 6 months ago
- ONNX Runtime Server: The ONNX Runtime Server is a server that provides TCP and HTTP/HTTPS REST APIs for ONNX inference.☆127Updated 2 weeks ago
- batched loras☆336Updated last year
- The Universe of Evaluation. All about the evaluation for LLMs.☆219Updated 4 months ago
- 42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot LLM-SFT, which is trained to respond to …☆122Updated 8 months ago
- QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆112Updated 8 months ago
- Inference Llama/Llama2 Modes in NumPy☆20Updated last year
- A performance library for machine learning applications.☆180Updated last year
- Large-scale language modeling tutorials with PyTorch☆287Updated 3 years ago
- Tune MPTs☆84Updated last year
- Korean SAT leader board☆151Updated this week
- A bagel, with everything.☆312Updated 7 months ago
- ☆26Updated last year
- Python Project Template☆67Updated 2 years ago
- showing various ways to serve Keras based stable diffusion☆109Updated last year
- [Google Meet] MLLM Arxiv Casual Talk☆55Updated last year
- Inference of Mamba models in pure C☆178Updated 8 months ago
- Comparison of Language Model Inference Engines☆190Updated 2 months ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆37Updated 3 weeks ago