CVxTz / distill-llmLinks
☆21Updated last year
Alternatives and similar repositories for distill-llm
Users that are interested in distill-llm are comparing it to the libraries listed below
Sorting:
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆117Updated this week
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- Efficient few-shot learning with cross-encoders.☆58Updated last year
- Evaluation of bm42 sparse indexing algorithm☆68Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆135Updated last year
- Fine-Tuning LLM and embedding models☆27Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆61Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 11 months ago
- Code, data, and model of paper "Text-to-SQL Error Correction with Language Models of Code" (ACL'23)☆31Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- Generalist and Lightweight Model for Text Classification☆161Updated 3 months ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆67Updated 7 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated last week
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated 2 years ago
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆25Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆179Updated last year
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆44Updated last year
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆228Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆69Updated 9 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆112Updated last year
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extraction…☆105Updated last year
- ☆77Updated 8 months ago
- Code for KaLM-Embedding models☆91Updated 2 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆28Updated last year
- Simply, faster, sentence-transformers☆143Updated last year
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆102Updated 8 months ago
- ☆50Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆85Updated 2 months ago
- Universal text classifier for generative models☆24Updated last year