Easy, efficient and Pythonic data loading of Parquet files for PyTorch-based libraries
☆24Sep 21, 2020Updated 5 years ago
Alternatives and similar repositories for PyParquetLoaders
Users that are interested in PyParquetLoaders are comparing it to the libraries listed below
Sorting:
- Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…☆11Oct 16, 2022Updated 3 years ago
- Traditional operating systems are reactive - they wait for user input or system events before taking action. SwarmOS breaks this paradigm…☆15Dec 6, 2024Updated last year
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆21Oct 29, 2022Updated 3 years ago
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆39Dec 22, 2025Updated 3 months ago
- ☆27Mar 29, 2025Updated 11 months ago
- A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…☆11Dec 11, 2023Updated 2 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 7 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- The corresponding code from our paper "Social Commonsense Reasoning with Multi-Head Knowledge Attention (EMNLP 2020)". Do not hesitate to…☆11Jun 12, 2022Updated 3 years ago
- Code and dataset for the EMNLP 2024 paper: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory☆48Sep 26, 2024Updated last year
- Historical shortest-path distance querying index by pruned landmark labeling☆10May 24, 2014Updated 11 years ago
- Prompt-based software development☆23Aug 25, 2024Updated last year
- A small library that wraps Keras models to pickle them.☆14Jul 17, 2018Updated 7 years ago
- Baseline models for the paper: "Modeling Naive Psychology of Characters in Simple Commonsense Stories" by Hannah Rashkin, Antoine Bosselu…☆16Feb 23, 2021Updated 5 years ago
- HatAsm is a powerful assembler and disassembler that provides support for all common architectures.☆18Jan 5, 2026Updated 2 months ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- Data on verb transitivity in English and script to extract transitivity information from Google's syntactic ngrams corpus☆11Oct 1, 2018Updated 7 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- IJCNN 2021: Inductive Learning on Commonsense Knowledge Graph Completion (Depreciated)☆15Nov 13, 2023Updated 2 years ago
- Cowic is a C++ library to compress formatted log like Apache access log.☆11May 3, 2015Updated 10 years ago
- ☆13Mar 27, 2020Updated 5 years ago
- Deterministic Acyclic Finite State Automaton implementation for morphological analysis☆18Dec 17, 2020Updated 5 years ago
- ☆43Aug 6, 2024Updated last year
- Locality sensitive hash functions for Tensorflow 2.0.☆12Feb 18, 2022Updated 4 years ago
- Comprehensive Information Integration Modeling Framework for Video Titling☆11Aug 27, 2020Updated 5 years ago
- A full-featured, hackable Next.js AI chatbot built by Vercel but running solely on a VPS, no outside APIs except for LLMs☆12Apr 16, 2024Updated last year
- CIKM 2019 E-Commerce AI Challenge - 超大规模推荐之用户兴趣高效检索☆11Jul 14, 2021Updated 4 years ago
- 短视频内容理解与推荐竞赛☆12Feb 18, 2019Updated 7 years ago
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆14Nov 11, 2023Updated 2 years ago
- ☆15Sep 24, 2022Updated 3 years ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆43Oct 27, 2025Updated 4 months ago
- Benchmark for Answering Existential First Order Queries with Single Free Variable (NeurIPS dataset and benchmark 2021)☆20May 3, 2023Updated 2 years ago
- Click-Through Rate Estimation for Rare Events in Online Advertising☆14Oct 17, 2018Updated 7 years ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆19May 2, 2025Updated 10 months ago
- Detect and remove unused dependencies for Python projects☆18Apr 5, 2025Updated 11 months ago
- This is an implementation of CartoonGAN in pytorch, including both ".py" and ".ipynb" version.☆12Nov 28, 2019Updated 6 years ago
- Ranking and Selecting Multi-Hop Knowledge Paths to Better Predict Human Needs (NAACL 2019)☆16Mar 22, 2021Updated 5 years ago