patronus-ai / copyright-evals
☆18Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for copyright-evals
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.☆23Updated this week
- ☆68Updated 8 months ago
- Japanese LLaMa experiment☆52Updated 8 months ago
- Writing Blog Posts with Generative Feedback Loops!☆43Updated 8 months ago
- ☆48Updated 2 weeks ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆42Updated last year
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated 11 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 2 years ago
- Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.☆9Updated 11 months ago
- Automated testing and benchmarking for code generation agents.☆17Updated last year
- Track the progress of LLM context utilisation☆53Updated 4 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆50Updated this week
- ☆20Updated 9 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- Web UI & Backend for Data Annotations in Aya☆26Updated 8 months ago
- Do Multilingual Language Models Think Better in English?☆41Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆98Updated 10 months ago
- ☆38Updated 7 months ago
- ☆16Updated 2 years ago
- Project of llm evaluation to Japanese tasks☆77Updated this week
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆77Updated 8 months ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated 5 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated last month
- aiXplain enables python programmers to add AI functions to their software.☆27Updated this week
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆77Updated last month
- Hosting the JSON for the GPT4 Tokenizer☆65Updated last year