Anton Karl Ingason

  • Increase font size
  • Default font size
  • Decrease font size

The Free Icelandic-English Dictionary for Natural Language Processing

Print
Citation
Anton Karl Ingason. 2009.
The Free Icelandic-English Dictionary for Natural Language Processing. Version 1.0.
http://www.hi.is/~antoni/

About
The first version of this dictionary is really just a small list of Icelandic-English translations in a format which is easy to use for Natural Language Processing / Computational Linguistics. There are about 6000 word ids in version 1.0. A word id has the format lemma_posTag. The Part-of-Speech (PoS) tag indicates the word class ('n' for noun, 's' for verb, etc.) and in the case of a noun there is a second character for gender so 'nk' means noun-masculine, 'nv' means noun-feminine and 'nh' means noun-neuter.

In many cases more than one possible translation is given. The vocabulary is a mix of words that are generally common in Icelandic and words which are frequent in news texts. Because of the small size and simple presentation this is by no means a substitute for a 'real' Icelandic-English dictionary, but it can be useful if you don't have a licence for such a resource or if you just want to do some small scale experiments which require a mapping between the two languages. A few translations are prefixed with '*' which means a potentially correct translation for an incorrectly lemmatized word. Those entries are only relevant if you are using the Lemmald lemmatizer.

Errors
Are there errors in the dictionary? Yes. Please let me know by email (see left side of the page) if you find some and those will be corrected before the next version is realeased.

Licence
The Free Icelandic-English Dictionary for Natural Language Processing is distributed under a LGPL licence which means you can use it for most purposes, including commercial use, but read the licence for details about how it works (it is included in the distribution).

Download
Here you can download The Free Icelandic-English Dictionary for Natural Language Processing, Version 1.0 (zip archive).

Last Updated on Tuesday, 15 September 2009 00:34