patronus-ai / copyright-evals
☆18Updated 6 months ago
Related projects: ⓘ
- ☆38Updated 5 months ago
- Do Multilingual Language Models Think Better in English?☆41Updated last year
- An OpenAI Completions API compatible server for NLP transformers models☆54Updated 10 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆101Updated last week
- ☆67Updated 6 months ago
- ☆41Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆41Updated last year
- Annotation meets Large Language Models (ChatGPT, GPT-3 and alike).☆52Updated last year
- Track the progress of LLM context utilisation☆53Updated 2 months ago
- Writing Blog Posts with Generative Feedback Loops!☆41Updated 6 months ago
- Project of llm evaluation to Japanese tasks☆67Updated last week
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆73Updated 6 months ago
- Web UI & Backend for Data Annotations in Aya☆26Updated 6 months ago
- ☆47Updated 3 weeks ago
- ☆58Updated 3 weeks ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated 8 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆33Updated 6 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆33Updated 5 months ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆84Updated 6 months ago
- Evaluating LLMs with CommonGen-Lite☆83Updated 6 months ago
- Japanese LLaMa experiment☆50Updated 6 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆49Updated 3 weeks ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆71Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆38Updated last month
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆51Updated last month
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 9 months ago
- ☆71Updated 3 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆32Updated last year
- Automated testing and benchmarking for code generation agents.☆17Updated last year
- ☆43Updated 7 months ago