Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
☆30Jul 12, 2021Updated 4 years ago
Alternatives and similar repositories for tokenizations
Users that are interested in tokenizations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆195Oct 4, 2023Updated 2 years ago
- An Interactive Tool for Annotating Discourse Structure and Text Improvement☆16Sep 15, 2021Updated 4 years ago
- Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models☆16Sep 13, 2021Updated 4 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆17Nov 11, 2021Updated 4 years ago
- ☆17May 19, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- personalized-llms with allen institute☆13Jun 22, 2023Updated 2 years ago
- Deep Counterfactual Prediction with Categorical Backward Variables☆12Feb 8, 2023Updated 3 years ago
- ☆12Oct 4, 2021Updated 4 years ago
- Official implementation of the ACL 2022 paper "Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization"☆14Dec 26, 2022Updated 3 years ago
- Elegant and fast Material Design template for academics. Perfect 100/100 performance score.☆12Mar 21, 2025Updated last year
- ☆59May 4, 2022Updated 4 years ago
- Code for HypMix EMNLP 2021 (main)☆23Oct 4, 2021Updated 4 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- ☆24May 22, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch utilities for ML, specifically speech☆13Jan 30, 2024Updated 2 years ago
- ☆10Jun 11, 2024Updated 2 years ago
- Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…☆14Nov 13, 2023Updated 2 years ago
- Code for "A Principled Framework for Multi-View Contrastive Learning"☆20Jul 10, 2025Updated 11 months ago
- ☆12May 21, 2019Updated 7 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer lear…☆40Dec 15, 2024Updated last year
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆17May 30, 2025Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆35Mar 26, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This small project demonstrates how to integrate WordPress blog entries into queries for a RAG-based (Retriever-Augmented Generation) lan…☆11Apr 2, 2024Updated 2 years ago
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆14Oct 8, 2024Updated last year
- Codes for the experiments in our EMNLP 2021 paper "Open Aspect Target Sentiment Classification with Natural Language Prompts"☆37Nov 4, 2021Updated 4 years ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- A CoroutineExecutor for asyncio, similar to nurseries and task groups☆13Aug 20, 2022Updated 3 years ago
- ☆23Sep 21, 2020Updated 5 years ago
- A Python micro framework for creating LLM-driven agents☆22May 20, 2025Updated last year
- Build ML pipelines with smart caching and remote execution. Develop locally, deploy to HPC clusters instantly. Track with Aim. 🎯☆13Feb 10, 2026Updated 4 months ago
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Low-rank Highway Networks☆13Mar 11, 2016Updated 10 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated last year
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 4 years ago
- ☆11Dec 19, 2023Updated 2 years ago
- Python package for parsing very large XML files☆11Oct 3, 2018Updated 7 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago