sahibpreetsingh12 / llm-learningLinks

☆14

Alternatives and similar repositories for llm-learning

Users that are interested in llm-learning are comparing it to the libraries listed below

Sorting:

docugami / DFM-benchmarks
Benchmarks for Business Document Foundation Models
☆10Updated last year
shahrukhx01 / bert-probe
BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…
☆18Updated 3 years ago
mesolitica / multimodal-LLM
Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.
☆18Updated last year
IBM / model-recycling
Ranking of fine-tuned HF models as base models.
☆35Updated 3 months ago
v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated last year
kumar-shridhar / Screws
SCREWS: A Modular Framework for Reasoning with Revisions
☆27Updated last year
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated 2 months ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Updated last year
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 9 months ago
nbroad1881 / strideformer
Using short models to classify long texts
☆21Updated 2 years ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆62Updated 2 months ago
darrow-labs / LegalLens
☆8Updated last year
padas-lab-de / ir-rag-sigir24-persona-rag
☆47Updated 10 months ago
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆24Updated 2 weeks ago
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆19Updated last year
Vedant-S / MLOps-Project
Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.
☆12Updated 4 years ago
sanyalsunny111 / Early_Weight_Avg
[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training
☆17Updated 9 months ago
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆47Updated last year
stanfordnlp / huggingface-models
Scripts for pushing models to huggingface repos
☆13Updated 7 months ago
bhavsarpratik / semantic-search
[WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…
☆15Updated 2 years ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
Hemanthkumar2112 / Reward-Modeling-RLHF-Finetune-and-RAG
Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform
☆22Updated 5 months ago
sayakpaul / CI-CD-for-Model-Training
This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.
☆21Updated 3 years ago
cleanlab / multiannotator-benchmarks
Benchmarking algorithms for assessing quality of data labeled by multiple annotators
☆32Updated 2 years ago
stanfordnlp / multi-distribution-retrieval
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆15Updated last year
lucy3 / whos_filtered
☆14Updated 10 months ago
sayakpaul / GCP-ML-API-Demos
Contains Colab Notebooks show cool use-cases of different GCP ML APIs.
☆10Updated 4 years ago
philschmid / optimum-static-quantization
☆28Updated 2 years ago
ashishpatel26 / Image-Search-Engine
Image Search Engine with HuggingFace Sentence Transformer
☆12Updated last year