Main Page

From Icelandic Parsed Historical Corpus (IcePaHC)
Revision as of 09:02, 1 July 2010 by Anton (Talk | contribs)

Jump to: navigation, search

This is the wiki for the Icelandic Parsed Historical Corpus (IcePaHC). It is mostly used to document the annotation standard for those constructing and using the corpus. The annotation scheme is meant to be mostly compatible with the Penn historical corpora, and the guidelines here are written as a supplement to the Penn guidelines, so look at Beatrice Santorini's guidelines for further information.

How to get a copy?

The treebank is under construction but preview versions will be released regularly during the construction. The first preview, version 0.1, will be released July 1st 2010, for download from this site (under a free and open source license). Until then you can watch the development at Github.

Citation for the version 0.1 preview release (of July 1st 2010)

Wallenberg, Joel, Anton Karl Ingason, Einar Freyr Sigurðsson and Eiríkur Rögnvaldsson. 2010. 
Icelandic Parsed Historical Corpus (IcePaHC). 
Version 0.1. http://www.linguist.is/wiki

Annotation guidelines:

Search PPCME/PPCEME documentation (ling.upenn.edu/~beatrice/annotation) <html> <form method="get" action="http://www.google.com/search">

  <a href="http://www.google.com/>  
  <img src="http://www.google.com/logos/Logo_40wht.gif" border="0" alt="Google"></a>  
  <input type="text" name="q" maxlength="255" />  
  <input type="submit" value="Google Search" />  
  <input style="visibility:hidden" type="radio" name="sitesearch" value="http://ling.upenn.edu/~beatrice/annotation/" checked="checked" />

</form> </html>

General information

Annotation team stuff:

Resources

Treebank team:

Grants

The project is funded in part by the following grants:

  • From the Icelandic Research Fund (RANNÍS), grant Viable Language Technology beyond English – Icelandic as a test case.
  • From the National Science Foundation (NSF), post-doc-something (Joel, ...)