modal-labs / mistral-finetuningLinks
☆31Updated last year
Alternatives and similar repositories for mistral-finetuning
Users that are interested in mistral-finetuning are comparing it to the libraries listed below
Sorting:
- Using modal.com to process FineWeb-edu data☆20Updated last month
- ☆31Updated last year
- ☆32Updated last year
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- ☆43Updated 3 months ago
- ☆37Updated 2 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆35Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- Chrome Extension for YouTube. Acts as an assistant for the YouTube video you are watching☆23Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆48Updated last year
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- ☆33Updated 2 years ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆40Updated last year
- ☆57Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 6 months ago
- ☆30Updated 10 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆72Updated last week
- utilities for loading and running text embeddings with onnx☆44Updated 9 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆29Updated last year
- Apps that run on modal.com☆12Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- ☆24Updated 4 months ago
- a version of baby agi using dspy and typed predictors☆17Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 7 months ago
- ☆48Updated last year