☆44Dec 28, 2022Updated 3 years ago
Alternatives and similar repositories for interesting-text-datasets
Users that are interested in interesting-text-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Jun 26, 2021Updated 4 years ago
- clip retrieval benchmark☆17May 4, 2022Updated 3 years ago
- Turn any collection of files into a dataset☆45Mar 10, 2023Updated 3 years ago
- A re-implementation of Stable-Diffusion using better code pratices with faster and lower-memory usage.☆45Feb 8, 2023Updated 3 years ago
- Multidocument Summarization for Literature Review Shared Task 2022☆30Oct 16, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- GPT2 Byte Pair Encoding implementation in Golang☆25Jul 9, 2025Updated 9 months ago
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74May 16, 2022Updated 3 years ago
- Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.☆12May 1, 2020Updated 5 years ago
- An adaptive training algorithm for residual network☆17Aug 22, 2020Updated 5 years ago
- discord bot using AI to generate images based on discord messages☆11Oct 10, 2023Updated 2 years ago
- Sentencepiece based BPE tokenizer for English and Japanese language text.☆28Apr 4, 2024Updated 2 years ago
- ☆24Jun 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Mar 22, 2023Updated 3 years ago
- A script for immunizing a google account for the effects of 13 September which will break some Google Drive Links☆14Sep 13, 2021Updated 4 years ago
- My explorations into editing the knowledge and memories of an attention network☆35Dec 8, 2022Updated 3 years ago
- Platform and API Agnostic library for powering chatbots☆23Feb 27, 2023Updated 3 years ago
- Implementation of a simple BPE tokenizer, but in Nim☆22Jul 2, 2023Updated 2 years ago
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- Generate visual podcasts about novels using open source models☆27Feb 15, 2023Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 5 years ago
- The codes for training sparsity predictor on LLaMA.☆18May 12, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆65Oct 4, 2023Updated 2 years ago
- ☆30Nov 25, 2021Updated 4 years ago
- Twitter Auto-reply bot☆13Dec 10, 2014Updated 11 years ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Oct 20, 2022Updated 3 years ago
- The goal is to pilot Microsoft Cognitive Services to unlock the strategic value of UN unstructured content by building on AI and semantic…☆16Jul 6, 2023Updated 2 years ago
- Gather module dependencies of source code☆13Jul 21, 2023Updated 2 years ago
- This is an implementation of CartoonGAN in pytorch, including both ".py" and ".ipynb" version.☆12Nov 28, 2019Updated 6 years ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Jan 2, 2022Updated 4 years ago
- A tool for benchmarking image generation models.☆33Jan 13, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An open source implementation of CLIP.☆33Nov 7, 2022Updated 3 years ago
- This repository will be a summary and outlook on all our open, medical, AI advancements.☆30Feb 24, 2023Updated 3 years ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Apr 22, 2026Updated last week
- A Collection of Pydantic Models to Abstract IRL☆39Dec 10, 2025Updated 4 months ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 7 months ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆15Oct 27, 2023Updated 2 years ago
- Jupyter Widget wrapper for Lineup.js☆12Mar 15, 2023Updated 3 years ago