bethelmelesse / UnifiedCrawl
☆11Updated 2 months ago
Alternatives and similar repositories for UnifiedCrawl:
Users that are interested in UnifiedCrawl are comparing it to the libraries listed below
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 10 months ago
- The first dense retrieval model that can be prompted like an LM☆64Updated 4 months ago
- ☆39Updated this week
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆29Updated 2 months ago
- XmodelLM☆39Updated 2 months ago
- utilities for loading and running text embeddings with onnx☆43Updated 5 months ago
- ☆52Updated 8 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆34Updated last week
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆101Updated this week
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Using modal.com to process FineWeb-edu data☆19Updated last month
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆17Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 8 months ago
- Efficient few-shot learning with cross-encoders.☆44Updated 11 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆52Updated 11 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆20Updated last month
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆59Updated 5 months ago
- ☆48Updated 2 months ago
- Tokun to can tokens☆15Updated this week
- Supercharge huggingface transformers with model parallelism.☆76Updated 3 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated last month
- ☆46Updated 11 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated 6 months ago
- ☆42Updated last week
- Tools for formatting large language model prompts.☆12Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- Routing on Random Forest (RoRF)☆100Updated 4 months ago