Difference between revisions of "Tagset"

From Icelandic Parsed Historical Corpus (IcePaHC)
Jump to: navigation, search
(Verbs)
 
(9 intermediate revisions by 2 users not shown)
Line 1: Line 1:
This is the head level tagset used in the Icelandic Treebank. The tagset is based on the [[IFD Tagset]]. Each head is assigned a tag (N-NSDIC in the example below):
+
This is the head level tagset used in the IcePaHC.
 
+
<synttree>[NP[N-NSDIC[barni]]]</synttree>
+
 
+
In the example ''barni'' is a dative form of the Icelandic word for 'child'. The first part of the tag, before the dash, always represents the [[Head Types|head type]] (roughly 'word class'), e.g. N for '''N'''oun. The extension of the above tag, NSDIC, means: '''N'''euter, '''S'''ingular, '''D'''ative, '''I'''ndefinite (no suffixed article), '''C'''ommon Noun. The predashial [[Head Types|head type]] can be one or more characters while the postdashial subfeatures are always one character per feature.
+
  
 
==Tags with postdashial subfeatures==
 
==Tags with postdashial subfeatures==
Line 60: Line 56:
 
|1  ||  Word class ||      '''NUM''' numeral
 
|1  ||  Word class ||      '''NUM''' numeral
 
|-
 
|-
|3 ||  Case    ||    '''N''' nominative, '''A''' accusative, '''D''' dative, '''G''' genitive   
+
|2 ||  Case    ||    '''N''' nominative, '''A''' accusative, '''D''' dative, '''G''' genitive   
 
|-
 
|-
 
|  ||  Examples    ||      '''NUM-N''' numeral, nominative; '''NUM-G''' numeral, genitive
 
|  ||  Examples    ||      '''NUM-N''' numeral, nominative; '''NUM-G''' numeral, genitive
Line 74: Line 70:
 
|2  ||  Tense  ||    '''P''' present, '''D''' past
 
|2  ||  Tense  ||    '''P''' present, '''D''' past
 
|-
 
|-
|3  ||  Mood      ||      '''ZZZ''' imperative, '''I''' indicative, '''S''' subjunctive
+
|3  ||  Mood      ||      '''I''' imperative (no tense), '''I''' indicative, '''S''' subjunctive
 
|-
 
|-
|  ||  Examples    ||      '''VB''' verb, infinitive; '''MD''' modal, infinitive; '''VBPI''' verb, present, indicative; '''BEDS''' VERA, past, subjunctive
+
|  ||  Examples    ||      '''VBI''' verb, imperative; '''VB''' verb, infinitive; '''MD''' modal, infinitive; '''VBPI''' verb, present, indicative; '''BEDS''' VERA, past, subjunctive
 
|}
 
|}
  
Line 83: Line 79:
 
!# || Category/Feature || Symbol – semantics
 
!# || Category/Feature || Symbol – semantics
 
|-
 
|-
|1  ||  Word class  ||    passive participle: '''VAN''', '''BAN''', '''DAN''', '''HAN'''
+
|1  ||  Word class  ||    passive participle: '''[[VAN]]''', '''BAN''', '''DAN''', '''HAN'''
 
|-
 
|-
 
|2  ||  Case  ||          '''N''' nominative, '''A''' accusative, '''D''' dative, '''G''' genitive
 
|2  ||  Case  ||          '''N''' nominative, '''A''' accusative, '''D''' dative, '''G''' genitive
 +
|-
 
|  ||  Examples    ||      '''VAN-A''' verb, passive participle, accusative; '''DAN-D''' GERA, passive participle, dative; '''HAN-G''' HAFA, passive participle, genitive
 
|  ||  Examples    ||      '''VAN-A''' verb, passive participle, accusative; '''DAN-D''' GERA, passive participle, dative; '''HAN-G''' HAFA, passive participle, genitive
 
|}
 
|}
Line 95: Line 92:
 
|-
 
|-
 
|2  ||  Case  ||          '''N''' nominative, '''A''' accusative, '''D''' dative, '''G''' genitive
 
|2  ||  Case  ||          '''N''' nominative, '''A''' accusative, '''D''' dative, '''G''' genitive
 +
|-
 
|  ||  Examples    ||      '''VBN-A''' verb, past participle, accusative; '''DON-D''' GERA, past participle, dative; '''HVN-G''' HAFA, past participle, genitive
 
|  ||  Examples    ||      '''VBN-A''' verb, past participle, accusative; '''DON-D''' GERA, past participle, dative; '''HVN-G''' HAFA, past participle, genitive
 
|}
 
|}
Line 102: Line 100:
 
!# || Category/Feature || Symbol – semantics
 
!# || Category/Feature || Symbol – semantics
 
|-
 
|-
|1 ||    Word class  ||    '''P'''–preposition
+
|1 ||    Word class  ||    '''P''' preposition
|-
+
|2  ||  Case governed  ||  '''A'''–governs accusative, '''D'''–governs dative, '''G'''–governs genitive
+
 
|}
 
|}
  
Line 111: Line 107:
 
!# || Category/Feature || Symbol – semantics
 
!# || Category/Feature || Symbol – semantics
 
|-
 
|-
|1 ||    Word class  ||    '''ADV'''–adverb
+
|1 ||    Word class  ||    '''ADV'''–adverb, '''ADVR''' comparative, '''ADVS''' superlative
|-
+
|2  ||  Category  ||      '''N'''–normal, '''I'''–exclamation, '''A'''–governs accusative, '''D'''–governs dative (like ''nærri því'' 'almost', see [[ADVP]]), '''G'''–governs genitive
+
|-
+
|3  ||  Degree  ||        '''C'''–comparative, '''S'''–superlative
+
 
|}
 
|}
  
 
==Simple Tags==
 
==Simple Tags==
  
 +
*[[C]] - complementizer
 
*[[CONJ]] - conjunction
 
*[[CONJ]] - conjunction
*[[FOREIGN]] - foreign word
+
*[[FW]] - foreign word
 
*[[NEG]] - negation, ''ekki'', ''eigi'', ''ei''
 
*[[NEG]] - negation, ''ekki'', ''eigi'', ''ei''
 
*[[TO]] - infinitival marker ''að'', 'to'.
 
*[[TO]] - infinitival marker ''að'', 'to'.
 
*X - unanalyzed word
 
*X - unanalyzed word

Latest revision as of 20:57, 9 August 2010

This is the head level tagset used in the IcePaHC.

Tags with postdashial subfeatures

Nouns

# Category/Feature Symbol – semantics
1 Word class N noun, NPR proper noun,
2 Number S plural
3 Case N nominative, A accusative, D dative, G genitive
Examples N-G noun, singular, genitive; NS-N noun, plural, nominative; NPR-A proper noun, singular, accusative; NPRS-Dproper noun, plural, dative

Adjectives

# Category/Feature Symbol – semantics
1 Word class ADJ adjective, ADJR adjective, comparative, ADJS adjective, superlative
2 Case N nominative, A accusative, D dative, G genitive
Examples ADJ-N adjective, nominative; ADJR-D adjective, comparative; ADJS-G adjective, superlative, genitive

Pronouns

# Category/Feature Symbol – semantics
1 Word class PRO pronoun
2 Case N nominative, A accusative, D dative, G genitive
Examples PRO-A pronoun, accusative; PRO-D pronoun, dative

Article (determiner)

# Category/Feature Symbol – semantics
1 Word class D determiner
2 Case N nominative, A accusative, D dative, G genitive
Examples D-D determiner, dative; D-G determiner, genitive

Numbers

# Category/Feature Symbol – semantics
1 Word class NUM numeral
2 Case N nominative, A accusative, D dative, G genitive
Examples NUM-N numeral, nominative; NUM-G numeral, genitive

Verbs

(see table below for passive participle (VAN) and past participle (VBN))

# Category/Feature Symbol – semantics
1 Word class VB verb; BE VERA (BE verb); DO GERA (DO verb); HV HAFA (HAVE verb); MD modal verb; RD VERÐA (WILL BE/BECOME)
2 Tense P present, D past
3 Mood I imperative (no tense), I indicative, S subjunctive
Examples VBI verb, imperative; VB verb, infinitive; MD modal, infinitive; VBPI verb, present, indicative; BEDS VERA, past, subjunctive


# Category/Feature Symbol – semantics
1 Word class passive participle: VAN, BAN, DAN, HAN
2 Case N nominative, A accusative, D dative, G genitive
Examples VAN-A verb, passive participle, accusative; DAN-D GERA, passive participle, dative; HAN-G HAFA, passive participle, genitive
# Category/Feature Symbol – semantics
1 Word class past participle: VBN, BEN, DON, HVN, RDN
2 Case N nominative, A accusative, D dative, G genitive
Examples VBN-A verb, past participle, accusative; DON-D GERA, past participle, dative; HVN-G HAFA, past participle, genitive

Prepositions

# Category/Feature Symbol – semantics
1 Word class P preposition

Adverbs

# Category/Feature Symbol – semantics
1 Word class ADV–adverb, ADVR comparative, ADVS superlative

Simple Tags

  • C - complementizer
  • CONJ - conjunction
  • FW - foreign word
  • NEG - negation, ekki, eigi, ei
  • TO - infinitival marker , 'to'.
  • X - unanalyzed word