Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
☆30Jul 12, 2021Updated 4 years ago
Alternatives and similar repositories for tokenizations
Users that are interested in tokenizations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆194Oct 4, 2023Updated 2 years ago
- An Interactive Tool for Annotating Discourse Structure and Text Improvement☆16Sep 15, 2021Updated 4 years ago
- Hierarchical Universal Modular ANotator☆12Apr 21, 2026Updated 2 weeks ago
- Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models☆16Sep 13, 2021Updated 4 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆17Nov 11, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17May 19, 2023Updated 2 years ago
- personalized-llms with allen institute☆14Jun 22, 2023Updated 2 years ago
- Deep Counterfactual Prediction with Categorical Backward Variables☆12Feb 8, 2023Updated 3 years ago
- ☆12Oct 4, 2021Updated 4 years ago
- 💫 A spaCy package for Yohei Tamura's Rust tokenizations library☆35Mar 27, 2026Updated last month
- Elegant and fast Material Design template for academics. Perfect 100/100 performance score.☆12Mar 21, 2025Updated last year
- ☆59May 4, 2022Updated 4 years ago
- Code for HypMix EMNLP 2021 (main)☆23Oct 4, 2021Updated 4 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24May 22, 2023Updated 2 years ago
- PyTorch utilities for ML, specifically speech☆13Jan 30, 2024Updated 2 years ago
- ☆45Jun 5, 2021Updated 4 years ago
- ☆10Jun 11, 2024Updated last year
- Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…☆14Nov 13, 2023Updated 2 years ago
- ☆12May 21, 2019Updated 6 years ago
- Temporal Graph Rewiring Method with Expander Graphs☆12Oct 18, 2024Updated last year
- A span-sharing joint extraction framework for harvesting aspect sentiment triplets☆20Mar 4, 2022Updated 4 years ago
- tensor rank learning in CP decomposition via convolutional neural network☆11Apr 19, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆33Mar 26, 2023Updated 3 years ago
- This small project demonstrates how to integrate WordPress blog entries into queries for a RAG-based (Retriever-Augmented Generation) lan…☆11Apr 2, 2024Updated 2 years ago
- Mike X Cohen lecturelets on Analyzing Neural Time Series Data: Theory and Practice http://mikexcohen.com/lectures.html☆14Dec 30, 2020Updated 5 years ago
- [ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning☆16May 24, 2025Updated 11 months ago
- This repository contains the source codes for the paper: "SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environm…☆16Oct 11, 2021Updated 4 years ago
- ☆23Sep 21, 2020Updated 5 years ago
- Build ML pipelines with smart caching and remote execution. Develop locally, deploy to HPC clusters instantly. Track with Aim. 🎯☆13Feb 10, 2026Updated 2 months ago
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- End to End training of Spatial Pyramid Networks☆13Apr 8, 2017Updated 9 years ago
- Low-rank Highway Networks☆13Mar 11, 2016Updated 10 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated last year
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- ☆11Dec 19, 2023Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Enhance robot task understanding ability through visual semantic graph☆10May 20, 2021Updated 4 years ago