CVxTz / distill-llm
☆21Updated last year
Alternatives and similar repositories for distill-llm:
Users that are interested in distill-llm are comparing it to the libraries listed below
- Generalist and Lightweight Model for Text Classification☆121Updated last week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago
- Chunk your text using gpt4o-mini more accurately☆44Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- ☆33Updated last year
- Evaluation of bm42 sparse indexing algorithm☆65Updated 9 months ago
- ☆45Updated 6 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆82Updated 3 months ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated last year
- Efficient few-shot learning with cross-encoders.☆51Updated last year
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆65Updated 7 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆64Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- A new novel multi-modality (Vision) RAG architecture☆25Updated 6 months ago
- Code for KaLM-Embedding models☆75Updated last month
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆74Updated 6 months ago
- GLiNER model in a FastAPI microservice.☆41Updated 4 months ago
- Analysis on the cost of encoder based models☆11Updated 2 months ago
- Universal text classifier for generative models☆24Updated 8 months ago
- This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João…☆63Updated 6 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 7 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 9 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆45Updated last week
- ☆42Updated 2 months ago
- DSPY on action with OpenSource LLMs.☆70Updated last year
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆101Updated last year
- A RAG that can scale 🧑🏻💻☆11Updated 10 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year