modal-labs / mistral-finetuningLinks

☆32

Alternatives and similar repositories for mistral-finetuning

Users that are interested in mistral-finetuning are comparing it to the libraries listed below

Sorting:

axeld5 / pali_reason
Testing paligemma2 finetuning on reasoning dataset
☆18Updated 6 months ago
rosewang2008 / backtracing
Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.
☆89Updated 11 months ago
huggingface / huggingface-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
☆82Updated this week
geronimi73 / phi2-finetune
☆87Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆118Updated last year
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆49Updated last year
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆48Updated 5 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆54Updated 5 months ago
zozoheir / tinyllm
Develop, evaluate and monitor LLM applications at scale
☆100Updated 7 months ago
neoxelox / dspy-inspector
DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.
☆36Updated last year
Technoculture / personal-graph
Simple Graph Memory for AI applications
☆87Updated last month
S1M0N38 / dspy-arxiv
Explore the use of DSPy for extracting features from PDFs 🔎
☆43Updated last year
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆62Updated 10 months ago
shoggoth13 / agents-deconstructed
☆57Updated last year
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆108Updated 3 months ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
miralab-ai / autoreason
☆40Updated 7 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
JeezAI / DSPy_matchmaking
A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…
☆59Updated last year
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
fsndzomga / baby_agi_dspy
a version of baby agi using dspy and typed predictors
☆17Updated last year
shivamsanju / ragswift
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆38Updated last year
hwchase17 / chain-of-verification
☆32Updated last year
reactorsh / ambrosia
clean up your LLM datasets
☆115Updated 2 years ago
matthelmer / DSPy-examples
Example code using the DSPy framework.
☆18Updated last year
jmanhype / dspy-self-discover-framework
Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…
☆62Updated 11 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 7 months ago
AlexBodner / How_Much_VRAM
☆101Updated 10 months ago
n4ze3m / vexasearch
A function to do all
☆36Updated last year
nateraw / replicate-examples
☆74Updated last year