sebischair / FusionSentLinks
Repository of the ICNLSP 2024 paper "Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes"
☆17Updated 10 months ago
Alternatives and similar repositories for FusionSent
Users that are interested in FusionSent are comparing it to the libraries listed below
Sorting:
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- ☆48Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆32Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Updated last year
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Updated 4 months ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆72Updated last month
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 2 months ago
- ☆120Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Small finetuned LLMs for a diverse set of useful tasks☆127Updated 2 years ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆291Updated 8 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆69Updated last year
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆61Updated last year
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆171Updated 5 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆213Updated 2 months ago
- Completion After Prompt Probability. Make your LLM make a choice☆81Updated last year
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆336Updated 11 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆102Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆62Updated last year
- ☆55Updated 9 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 6 months ago
- Generalist and Lightweight Model for Text Classification☆165Updated 5 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆165Updated last year
- Guideline following Large Language Model for Information Extraction☆409Updated last year
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…☆52Updated 2 years ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆136Updated 10 months ago