sandlogic / SandLogic-Lexicons
SandLogic Lexicons
β16Updated 3 months ago
Alternatives and similar repositories for SandLogic-Lexicons:
Users that are interested in SandLogic-Lexicons are comparing it to the libraries listed below
- π€ Trade any tensors over the networkβ30Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated last month
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β34Updated 8 months ago
- Mixed precision training from scratch with Tensors and CUDAβ21Updated 8 months ago
- Article about deploying machine learning models using grpc, pytorch and asyncioβ27Updated 2 years ago
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated 7 months ago
- Notes on quantization in neural networksβ63Updated last year
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- End-to-End LLM Guideβ99Updated 6 months ago
- Fast low-bit matmul kernels in Tritonβ187Updated last week
- Nsight Systems in Dockerβ19Updated last year
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.β78Updated this week
- Fast sparse deep learning on CPUsβ51Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challengeβ54Updated 9 months ago
- Triton implementation of GPT/LLAMAβ16Updated 4 months ago
- ML/DL Math and Method notesβ57Updated last year
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterizationβ100Updated 3 months ago
- The backend behind the LLM-Perf Leaderboardβ10Updated 8 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUsβ37Updated 2 months ago
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://devβ¦β56Updated this week
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transforβ¦β57Updated this week
- An SDK for Transformers + YOLO and other SSD family modelsβ55Updated 3 weeks ago
- zero-to-lightningβ28Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 6 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundryβ40Updated last year
- Build Agentic workflows with function callingβ26Updated this week
- A collection of hand on notebook for LLMs practitionerβ40Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMsβ88Updated this week
- Quantization of LLMs and benchmarking.β10Updated 9 months ago