Tokun to can tokens
☆18Jun 19, 2025Updated 9 months ago
Alternatives and similar repositories for tokun
Users that are interested in tokun are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 3 months ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Apr 8, 2025Updated 11 months ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 4 years ago
- Gradio application using LLMs to generate csv/apkg to aid with memorizing topics in Anki☆25Mar 12, 2026Updated last week
- Showcasing the power of Ruby on Rails.☆12Jun 7, 2020Updated 5 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆12Mar 28, 2023Updated 2 years ago
- let's you chat with website. crawls a website, embeds to vectors, stores to Chroma.☆26Sep 16, 2023Updated 2 years ago
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- Slop Scoring to Stop Slop☆56Mar 16, 2026Updated last week
- My Gen AI research☆11Jun 3, 2024Updated last year
- Peer-to-Peer Databases for the Decentralized Web☆15Aug 24, 2023Updated 2 years ago
- ☆14Sep 18, 2024Updated last year
- Completely remade PDF viewer in Atom☆12Feb 8, 2020Updated 6 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- A vllm proxy server to add security and multi model management for vllm servers☆12May 30, 2024Updated last year
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆20Dec 29, 2024Updated last year
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆15Feb 24, 2024Updated 2 years ago
- QALD-9-Plus Dataset for Knowledge Graph Question Answering☆29Jun 5, 2024Updated last year
- Example of Langchain-Elasticsearch integrations & RAG.☆12Sep 20, 2024Updated last year
- ☆18Feb 10, 2018Updated 8 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTor…☆15Feb 27, 2024Updated 2 years ago
- A dual-chatbot system for learning languages based on LangChain☆13Jun 25, 2023Updated 2 years ago
- API Hashing and String Decryption Reverse Engineering Workshop☆20Jul 26, 2023Updated 2 years ago
- How NOT to optimize something☆15May 30, 2018Updated 7 years ago
- 🚀 Minimal Solidity contract testing with Ganache and Jest☆14Apr 30, 2019Updated 6 years ago
- Personnal collection of pipes and filters I use for open-webui☆26Mar 10, 2026Updated last week
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- ☆10Mar 22, 2024Updated 2 years ago
- ☆21Jun 4, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- ☆15Nov 9, 2022Updated 3 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago