Tagset

From Icelandic Parsed Historical Corpus (IcePaHC)
Revision as of 21:46, 1 May 2010 by Einarfs (Talk | contribs) (Verbs)

Jump to: navigation, search

This is the head level tagset used in the Icelandic Treebank. The tagset is based on the IFD Tagset. Each head is assigned a tag (N-NSDIC in the example below):

<synttree>[NP[N-NSDIC[barni]]]</synttree>

In the example barni is a dative form of the Icelandic word for 'child'. The first part of the tag, before the dash, always represents the head type (roughly 'word class'), e.g. N for Noun. The extension of the above tag, NSDIC, means: Neuter, Singular, Dative, Indefinite (no suffixed article), Common Noun. The predashial head type can be one or more characters while the postdashial subfeatures are always one character per feature.

Tags with postdashial subfeatures

Nouns

# Category/Feature Symbol – semantics
1 Word class N noun, NPR proper noun,
2 Number S plural
3 Case N nominative, A accusative, D dative, G genitive
Examples N-G noun, singular, genitive; NS-N noun, plural, nominative; NPR-A proper noun, singular, accusative; NPRS-Dproper noun, plural, dative

Adjectives

# Category/Feature Symbol – semantics
1 Word class ADJ adjective, ADJR adjective, comparative, ADJS adjective, superlative
2 Case N nominative, A accusative, D dative, G genitive
Examples ADJ-N adjective, nominative; ADJR-D adjective, comparative; ADJS-G adjective, superlative, genitive

Pronouns

# Category/Feature Symbol – semantics
1 Word class PRO pronoun
2 Case N nominative, A accusative, D dative, G genitive
Examples PRO-A pronoun, accusative; PRO-D pronoun, dative

Article (determiner)

# Category/Feature Symbol – semantics
1 Word class D determiner
2 Case N nominative, A accusative, D dative, G genitive
Examples D-D determiner, dative; D-G determiner, genitive

Numbers

# Category/Feature Symbol – semantics
1 Word class NUM numeral
3 Case N nominative, A accusative, D dative, G genitive
Examples NUM-N numeral, nominative; NUM-G numeral, genitive

Verbs

(see table below for passive participle (VAN) and past participle (VBN))

# Category/Feature Symbol – semantics
1 Word class VB verb; BE VERA (BE verb); DO GERA (DO verb); HV HAFA (HAVE verb); MD modal verb; RD VERÐA (WILL BE/BECOME)
2 Tense P–present, D–past
3 Mood ZZZ imperative, I indicative, S subjunctive
Examples


# Category/Feature Symbol – semantics
1 Word class VAN-verb, past participle
2 Tense D–past
3 Voice A–active, M–middle
4 Gender M–masculine, F–feminine, N–neuter
5 Number S–singular, P–plural
6 Case N–nominative, A–accusative, D–dative, G–genitive

Prepositions

# Category/Feature Symbol – semantics
1 Word class P–preposition
2 Case governed A–governs accusative, D–governs dative, G–governs genitive

Adverbs

# Category/Feature Symbol – semantics
1 Word class ADV–adverb
2 Category N–normal, I–exclamation, A–governs accusative, D–governs dative (like nærri því 'almost', see ADVP), G–governs genitive
3 Degree C–comparative, S–superlative

Simple Tags

  • CONJ - conjunction
  • FOREIGN - foreign word
  • NEG - negation, ekki, eigi, ei
  • TO - infinitival marker , 'to'.
  • X - unanalyzed word