blculyn/The-spoken-L1-corpus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/blculyn/The-spoken-L1-corpus)

blculyn / The-spoken-L1-corpus

The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus to the spoken L2 corpus. It comprises L1-L1 conversational interactions between L1 speakers of Chinese and a native Chinese speaker in informal settings. This corpus contains 228,306 words of transcribed intera…

☆23

Alternatives and similar repositories for The-spoken-L1-corpus

Users that are interested in The-spoken-L1-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

frederick-wang / jianmu
View on GitHub
A simple desktop app development framework combining Python, Vue.js, Element Plus and Electron.
☆11Feb 9, 2023Updated 3 years ago
frederick-wang / pyreactivity
View on GitHub
Providing a reactivity system similar to Vue.js for Python.
☆16Sep 28, 2024Updated last year
mengzaiqiao / awesome-natural-language-reasoning
View on GitHub
A collection of research papers related to Natural Language Reasoning
☆10May 27, 2022Updated 4 years ago
CocoTan1020 / MLF-BERT
View on GitHub
基于多层级语言特征融合的中文文本可读性分级模型
☆12Feb 27, 2024Updated 2 years ago
blcuicall / YACLC
View on GitHub
Yet Another Chinese Learner Corpus
☆77Jan 10, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rycolab / parsing-as-tagging
View on GitHub
☆21Nov 19, 2023Updated 2 years ago
vimqa / vimqa
View on GitHub
VIMQA dataset
☆15Jul 6, 2022Updated 4 years ago
krangelie / bias-in-german-nlg
View on GitHub
Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.
☆16Sep 25, 2024Updated last year
iris2hu / L2C-rater
View on GitHub
Automated Essay Scoring Method for Chinese Second Language Writing
☆33Mar 17, 2022Updated 4 years ago
kietnv / VietnameseDatasets
View on GitHub
We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…
☆22Jun 19, 2021Updated 5 years ago
Arborator / arborator-server
View on GitHub
The Arborator software is aimed at collaboratively annotating dependency corpora.
☆26Nov 5, 2019Updated 6 years ago
mmontone / garnet
View on GitHub
Garnet - a graphical toolkit for Lisp
☆21Dec 30, 2021Updated 4 years ago
kornai / 4lang
View on GitHub
Concept dictionary
☆41Apr 4, 2024Updated 2 years ago
YuhuYang / QuanSyn
View on GitHub
QuanSyn: A Python Package for Quantitative Syntax Analysis.
☆40Jul 3, 2026Updated 2 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
paulbricman / k-probes
View on GitHub
Promoting critical thinking through machine-generated prompts.
☆19Sep 21, 2021Updated 4 years ago
CIRCSE / LT4HALA
View on GitHub
Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)
☆38May 19, 2026Updated 2 months ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆28Sep 4, 2025Updated 10 months ago
nttcslab-nlp / word_align
View on GitHub
A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT
☆26Jan 27, 2021Updated 5 years ago
manakanemu / ctoj
View on GitHub
这是一个中日汉字文字转换网站
☆39Mar 4, 2021Updated 5 years ago
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
yeluo1994 / DSBI
View on GitHub
Double-Sided Braille Image Dataset
☆29Nov 18, 2020Updated 5 years ago
Jihuai-wpy / bert-ancient-chinese
View on GitHub
Pretrained BERT for Ancient (Classical) Chinese, with an expanded vocabulary for rare characters.
☆48Feb 20, 2023Updated 3 years ago
iris2hu / nlp-tasks-examples-icip
View on GitHub
This is a code example repo for the NLP course offered by the Institute of Chinese Information Processing of BNU.
☆57May 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CanCLID / awesome-cantonese-nlp
View on GitHub
A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP
☆95Oct 17, 2021Updated 4 years ago
thoppe / deep-phonics
View on GitHub
Deep learning spelling patterns with a recurrent neural network
☆11Jun 5, 2017Updated 9 years ago
nex-agi / NexHTML
View on GitHub
HTML Agent based on NexAU
☆16Nov 20, 2025Updated 8 months ago
iris2hu / ancient_chinese_sense_annotation
View on GitHub
Ancient Chinese Corpus with Word Sense Annotation
☆73May 29, 2024Updated 2 years ago
helloanoop / kgraph-whitepaper
View on GitHub
kgraph whitepaper
☆19Nov 16, 2021Updated 4 years ago
jaaack-wang / ChineseNLPCorpus
View on GitHub
中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。
☆38Dec 3, 2021Updated 4 years ago
turboMaCk / any-set
View on GitHub
Elm Set built on top of AnyDict
☆10Aug 12, 2024Updated last year
jialiang / EPSON-True-Regular-Script-Medium
View on GitHub
☆14Jan 20, 2023Updated 3 years ago
milesaturpin / cot-unfaithfulness
View on GitHub
☆57Oct 23, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
L-M-Sherlock / anki_revlog_analysis
View on GitHub
Anki 复习数据处理与分析
☆18May 18, 2021Updated 5 years ago
FeiSun / ContentExtraction
View on GitHub
Content Extraction via Text Density (SIGIR11)
☆24Sep 21, 2015Updated 10 years ago
lotusfa / IPA-Translator
View on GitHub
☆57May 4, 2026Updated 2 months ago
JENebel / const_for
View on GitHub
For loops in const
☆13Jul 3, 2026Updated 2 weeks ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
shenshen-hungry / Ancient-Chinese-Segmentation
View on GitHub
A tool for ancient Chinese segmentation.
☆54Apr 27, 2019Updated 7 years ago
jermp / tongrams_estimation
View on GitHub
A C++ library implementing fast language models estimation using the 1-Sort algorithm.
☆16May 18, 2023Updated 3 years ago