unit8co / mistral-hackathon-finetuningLinks
☆20Updated 9 months ago
Alternatives and similar repositories for mistral-hackathon-finetuning
Users that are interested in mistral-hackathon-finetuning are comparing it to the libraries listed below
Sorting:
- ☆47Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- ☆86Updated 9 months ago
- LLM finetuned for generating symbolic music☆41Updated 9 months ago
- inference code for mixtral-8x7b-32kseqlen☆100Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated 3 weeks ago
- Writing Blog Posts with Generative Feedback Loops!☆48Updated last year
- Cerule - A Tiny Mighty Vision Model☆66Updated 9 months ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆20Updated 4 months ago
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆17Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- Eh, simple and works.☆27Updated last year
- Set of scripts to finetune LLMs☆37Updated last year
- ☆34Updated 3 months ago
- ☆17Updated 4 months ago
- LLM reads a paper and produce a working prototype☆57Updated 2 months ago
- ☆47Updated 4 months ago
- ☆36Updated 4 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆42Updated last year
- Introducing Alplex, an AI-powered virtual law office designed to assist you with legal issues based on Swiss laws☆25Updated 11 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 10 months ago
- Scripts to create your own moe models using mlx☆90Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 3 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆45Updated 2 months ago
- Ongoing research training transformer models at scale☆38Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆120Updated last year