uygarkurt / BERT-PyTorch
☆16Updated 3 weeks ago
Alternatives and similar repositories for BERT-PyTorch:
Users that are interested in BERT-PyTorch are comparing it to the libraries listed below
- ☆15Updated 11 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆37Updated 3 months ago
- Experimenting with small language models☆59Updated last year
- Prune transformer layers☆67Updated 8 months ago
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆42Updated 4 months ago
- ☆66Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated last month
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆55Updated last month
- GGUF Quantization of any LLM.☆35Updated 10 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.☆68Updated last year
- ☆48Updated 2 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆34Updated 9 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 8 months ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆50Updated 9 months ago
- My Gen AI research☆11Updated 7 months ago
- ☆24Updated last year
- ☆18Updated 10 months ago
- Implementation of BitNet-1.58 instruct tuning☆18Updated 9 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆53Updated this week
- ☆108Updated 5 months ago
- Collection of autoregressive model implementation☆78Updated 3 weeks ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆129Updated 8 months ago
- The application of multimodal RAG for Sustainable finance☆17Updated 6 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 9 months ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year