samlhuillier / spider-sql-finetuneLinks
☆17Updated last year
Alternatives and similar repositories for spider-sql-finetune
Users that are interested in spider-sql-finetune are comparing it to the libraries listed below
Sorting:
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆37Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- ☆12Updated 11 months ago
- Verifiers for LLM Reinforcement Learning☆56Updated last month
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆20Updated last year
- ☆20Updated 7 months ago
- ☆33Updated last month
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- ☆24Updated 8 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- ☆17Updated last year
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆39Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- FuseAI Project☆87Updated 4 months ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated 6 months ago
- Dedicated to building industrial foundation models for universal data intelligence across industries.☆54Updated 9 months ago
- ☆15Updated last year
- ☆28Updated last year
- Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…☆19Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Rust bindings for CTranslate2☆14Updated last year
- Finetune any model on HF in less than 30 seconds☆58Updated 2 months ago
- Tools for merging pretrained large language models.☆19Updated 11 months ago
- A repository for research on medium sized language models.☆76Updated last year
- a tool for gerenate dataset from doc☆12Updated 2 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 10 months ago