cohere-ai / sandbox-multilingual
A demonstration of a multilingual semantic search engine you can be quickly built using Cohere's platform.
☆60Updated last year
Related projects ⓘ
Alternatives and complementary repositories for sandbox-multilingual
- Conversational AI tooling & personas built on Cohere's LLMs☆173Updated last year
- A sandbox repo for grounded question answering with Cohere and Google Search☆136Updated last year
- A demonstration of how a toy (but usable!) semantic search engine can be quickly built using Cohere's platform.☆115Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- ☆48Updated last year
- ☆24Updated last year
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆217Updated last year
- ☆31Updated last year
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- Interacting with Jarvislabs.ai for creating GPU/CPU powered instances on top of A100, A6000, RTX 5000.☆12Updated 3 months ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆105Updated last year
- ☆22Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆28Updated 2 months ago
- ☆46Updated this week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Camel-Coder: Collaborative task completion with multiple agents. Role-based prompts, intervention mechanism, and thoughtful suggestions☆33Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆149Updated 4 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆68Updated last year
- ☆20Updated last year
- Code repository for the c-BTM paper☆105Updated last year
- Hosting the JSON for the GPT4 Tokenizer☆65Updated last year
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆113Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆304Updated last year