modal-labs / mistral-finetuningLinks
☆34Updated 2 years ago
Alternatives and similar repositories for mistral-finetuning
Users that are interested in mistral-finetuning are comparing it to the libraries listed below
Sorting:
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆89Updated last month
- ☆86Updated last year
- Efficient vector database for hundred millions of embeddings.☆211Updated last year
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆30Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated 2 years ago
- ☆53Updated 11 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Updated last year
- Simple examples using Argilla tools to build AI☆57Updated last year
- ☆119Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆50Updated last week
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 11 months ago
- Develop, evaluate and monitor LLM applications at scale☆98Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- ☆37Updated 2 years ago
- Score LLM pretraining data with classifiers☆55Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated 2 years ago
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated 2 years ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated last year
- ☆45Updated 2 years ago
- ☆30Updated last year
- ☆56Updated 6 months ago
- ☆33Updated 2 years ago
- ☆106Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆73Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 3 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 3 months ago