corpus-data
Here are 155 public repositories matching this topic...
Data from a corpus of written Hawaiian
-
Updated
Jun 27, 2016
Kumpulan dokumen korpus dalam bahasa Indonesia berisi kasus uji deteksi plagiarisme eksternal dengan standar PAN CLEF (http://www.uni-weimar.de/medien/webis/events/pan-11).
-
Updated
Aug 8, 2016 - Python
Estonian TimeML Annotated Corpus \ Eesti keele TimeML märgendatud korpus
-
Updated
Nov 1, 2016 - Python
Build an n-way multilingual corpus
-
Updated
Jan 30, 2017 - Python
Vietnamese Wikipedia Corpus
-
Updated
May 18, 2017 - Python
A collection of research dataset files used for testing Archivematica integration and functionality in the JISC Research Data Shared Service (RDSS).
-
Updated
Jun 30, 2017 - HTML
A linguistic corpus of Czech native learners acquiring Italian language
-
Updated
Jul 10, 2017
A crawler for Coursera
-
Updated
Jul 17, 2017 - Python
Emacs Lisp corpus. Code collected from many-many projects for you to query it!
-
Updated
Oct 15, 2017 - Emacs Lisp
This repository contains some Karelian data with which Markus Juutinen and Niko Partanen have been involved with
-
Updated
Jan 3, 2018
A baseline results towards constructing readability corpus ARC-WMI, a new Arabic collection of written medicine information annotated with readability levels.
-
Updated
Feb 23, 2018
-
Updated
Apr 6, 2018 - R
Web app and tools for quantitative analysis of metaphor in corpora
-
Updated
Apr 16, 2018 - Jupyter Notebook
Lemma frequency list created from parsed monolingual OPUS XML files, made with Python and ❤️.
-
Updated
May 2, 2018 - Python
This corpus contails detail data scraped from daraz.com.np. This corpus is shared to assist Nepali Researchers in their projects.
-
Updated
Jun 1, 2018
-
Updated
Jul 20, 2018 - Jupyter Notebook
golden arabic corpus build for test Assem's arabicstemmer and other arabic stemmers
-
Updated
Aug 24, 2018 - Python
Improve this page
Add a description, image, and links to the corpus-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the corpus-data topic, visit your repo's landing page and select "manage topics."