Difference between revisions of "Tagset"

From Icelandic Parsed Historical Corpus (IcePaHC)
Jump to: navigation, search
(Article (determiner))
 
(13 intermediate revisions by 2 users not shown)
Line 1: Line 1:
This is the head level tagset used in the Icelandic Treebank. The tagset is based on the [[IFD Tagset]]. Each head is assigned a tag (N-NSDIC in the example below):
+
This is the head level tagset used in the IcePaHC.
 
+
<synttree>[NP[N-NSDIC[barni]]]</synttree>
+
 
+
In the example ''barni'' is a dative form of the Icelandic word for 'child'. The first part of the tag, before the dash, always represents the [[Head Types|head type]] (roughly 'word class'), e.g. N for '''N'''oun. The extension of the above tag, NSDIC, means: '''N'''euter, '''S'''ingular, '''D'''ative, '''I'''ndefinite (no suffixed article), '''C'''ommon Noun. The predashial [[Head Types|head type]] can be one or more characters while the postdashial subfeatures are always one character per feature.
+
  
 
==Tags with postdashial subfeatures==
 
==Tags with postdashial subfeatures==
Line 58: Line 54:
 
!# || Category/Feature || Symbol – semantics
 
!# || Category/Feature || Symbol – semantics
 
|-
 
|-
|1  ||  Word class ||      '''NUM'''–numeral
+
|1  ||  Word class ||      '''NUM''' numeral
 
|-
 
|-
|2   || Category    ||    '''P'''-(málfr. frumtala, þ.e. ekki raðtala???), '''F'''-percentage (fraction), '''O'''-other
+
|2  ||   Case     ||   '''N''' nominative, '''A''' accusative, '''D''' dative, '''G''' genitive 
|-
+
|3  ||  Gender      ||    '''M'''–masculine, '''F'''–feminine, '''N'''–neuter
+
|-
+
|4  ||  Number      ||    '''S'''–singular, '''P'''–plural
+
 
|-
 
|-
||| Case        || '''N'''–nominative, '''A'''–accusative, '''D'''-dative, '''G'''–genitive
+
| ||   Examples    ||     '''NUM-N''' numeral, nominative; '''NUM-G''' numeral, genitive
 
|}
 
|}
  
 
====Verbs====
 
====Verbs====
(see table below for past participle)
+
(see table below for passive participle (VAN) and past participle (VBN))
 
{|  
 
{|  
 
!# || Category/Feature || Symbol – semantics
 
!# || Category/Feature || Symbol – semantics
 
|-
 
|-
|1  ||  Word class  ||    '''VBP'''–verb present tense, '''VBD'''-verb past tense, '''VB'''-infinitive, '''VBI'''-imperative, '''VBN'''-perfect participle, '''VAG'''-present participle
+
|1  ||  Word class  ||    '''VB''' verb; '''BE''' VERA (BE verb); '''DO''' GERA (DO verb); '''HV''' HAFA (HAVE verb); '''MD''' modal verb; '''RD''' VERÐA (WILL BE/BECOME)
 
|-
 
|-
|2   ||  Mood      ||       '''T'''–infinitive, '''M'''–imperative, '''I'''–indicative, '''S'''–subjunctive, '''U'''–supine, '''P'''–present participle, '''D'''-past participle
+
|2 ||   Tense ||   '''P''' present, '''D''' past
 
|-
 
|-
|3  ||   Voice ||   '''A'''–active, '''M'''–middle
+
|3  ||  Mood      ||       '''I''' imperative (no tense), '''I''' indicative, '''S''' subjunctive
 
|-
 
|-
|4 ||   Person  ||   '''1'''–1st person, '''2'''–2nd person, '''3'''–3rd person,
+
|  ||   Examples    ||     '''VBI''' verb, imperative; '''VB''' verb, infinitive; '''MD''' modal, infinitive; '''VBPI''' verb, present, indicative; '''BEDS''' VERA, past, subjunctive
|-
+
|5  ||  Number  ||    '''S'''–singular, '''P'''–plural
+
|-
+
|6  ||  Tense  ||    '''P'''–present, '''D'''–past
+
 
|}
 
|}
  
Line 91: Line 79:
 
!# || Category/Feature || Symbol – semantics
 
!# || Category/Feature || Symbol – semantics
 
|-
 
|-
|1  ||  Word class  ||    '''VAN'''-verb, past participle
+
|1  ||  Word class  ||    passive participle: '''[[VAN]]''', '''BAN''', '''DAN''', '''HAN'''
 
|-
 
|-
|2   ||  Tense      ||   '''D'''–past
+
|2   ||  Case  ||         '''N''' nominative, '''A''' accusative, '''D''' dative, '''G''' genitive
 
|-
 
|-
||| Voice        ||   '''A'''–active, '''M'''–middle
+
| ||   Examples    ||     '''VAN-A''' verb, passive participle, accusative; '''DAN-D''' GERA, passive participle, dative; '''HAN-G''' HAFA, passive participle, genitive
 +
|}
 +
 
 +
{|
 +
!# || Category/Feature || Symbol – semantics
 
|-
 
|-
|||  Gender    ||     '''M'''–masculine, '''F'''–feminine, '''N'''–neuter
+
|||   Word class ||   past participle: '''VBN''', '''BEN''', '''DON''', '''HVN''', '''RDN'''
 
|-
 
|-
|5   ||  Number   ||       '''S'''–singular, '''P'''–plural
+
|2   ||  Case   ||         '''N''' nominative, '''A''' accusative, '''D''' dative, '''G''' genitive
 
|-
 
|-
||| Case   ||         '''N'''–nominative, '''A'''–accusative, '''D'''–dative, '''G'''–genitive
+
| ||  Examples    ||     '''VBN-A''' verb, past participle, accusative; '''DON-D''' GERA, past participle, dative; '''HVN-G''' HAFA, past participle, genitive
 
|}
 
|}
  
Line 108: Line 100:
 
!# || Category/Feature || Symbol – semantics
 
!# || Category/Feature || Symbol – semantics
 
|-
 
|-
|1 ||    Word class  ||    '''P'''–preposition
+
|1 ||    Word class  ||    '''P''' preposition
|-
+
|2  ||  Case governed  ||  '''A'''–governs accusative, '''D'''–governs dative, '''G'''–governs genitive
+
 
|}
 
|}
  
Line 117: Line 107:
 
!# || Category/Feature || Symbol – semantics
 
!# || Category/Feature || Symbol – semantics
 
|-
 
|-
|1 ||    Word class  ||    '''ADV'''–adverb
+
|1 ||    Word class  ||    '''ADV'''–adverb, '''ADVR''' comparative, '''ADVS''' superlative
|-
+
|2  ||  Category  ||      '''N'''–normal, '''I'''–exclamation, '''A'''–governs accusative, '''D'''–governs dative (like ''nærri því'' 'almost', see [[ADVP]]), '''G'''–governs genitive
+
|-
+
|3  ||  Degree  ||        '''C'''–comparative, '''S'''–superlative
+
 
|}
 
|}
  
 
==Simple Tags==
 
==Simple Tags==
  
 +
*[[C]] - complementizer
 
*[[CONJ]] - conjunction
 
*[[CONJ]] - conjunction
*[[FOREIGN]] - foreign word
+
*[[FW]] - foreign word
 
*[[NEG]] - negation, ''ekki'', ''eigi'', ''ei''
 
*[[NEG]] - negation, ''ekki'', ''eigi'', ''ei''
 
*[[TO]] - infinitival marker ''að'', 'to'.
 
*[[TO]] - infinitival marker ''að'', 'to'.
 
*X - unanalyzed word
 
*X - unanalyzed word

Latest revision as of 20:57, 9 August 2010

This is the head level tagset used in the IcePaHC.

Tags with postdashial subfeatures

Nouns

# Category/Feature Symbol – semantics
1 Word class N noun, NPR proper noun,
2 Number S plural
3 Case N nominative, A accusative, D dative, G genitive
Examples N-G noun, singular, genitive; NS-N noun, plural, nominative; NPR-A proper noun, singular, accusative; NPRS-Dproper noun, plural, dative

Adjectives

# Category/Feature Symbol – semantics
1 Word class ADJ adjective, ADJR adjective, comparative, ADJS adjective, superlative
2 Case N nominative, A accusative, D dative, G genitive
Examples ADJ-N adjective, nominative; ADJR-D adjective, comparative; ADJS-G adjective, superlative, genitive

Pronouns

# Category/Feature Symbol – semantics
1 Word class PRO pronoun
2 Case N nominative, A accusative, D dative, G genitive
Examples PRO-A pronoun, accusative; PRO-D pronoun, dative

Article (determiner)

# Category/Feature Symbol – semantics
1 Word class D determiner
2 Case N nominative, A accusative, D dative, G genitive
Examples D-D determiner, dative; D-G determiner, genitive

Numbers

# Category/Feature Symbol – semantics
1 Word class NUM numeral
2 Case N nominative, A accusative, D dative, G genitive
Examples NUM-N numeral, nominative; NUM-G numeral, genitive

Verbs

(see table below for passive participle (VAN) and past participle (VBN))

# Category/Feature Symbol – semantics
1 Word class VB verb; BE VERA (BE verb); DO GERA (DO verb); HV HAFA (HAVE verb); MD modal verb; RD VERÐA (WILL BE/BECOME)
2 Tense P present, D past
3 Mood I imperative (no tense), I indicative, S subjunctive
Examples VBI verb, imperative; VB verb, infinitive; MD modal, infinitive; VBPI verb, present, indicative; BEDS VERA, past, subjunctive


# Category/Feature Symbol – semantics
1 Word class passive participle: VAN, BAN, DAN, HAN
2 Case N nominative, A accusative, D dative, G genitive
Examples VAN-A verb, passive participle, accusative; DAN-D GERA, passive participle, dative; HAN-G HAFA, passive participle, genitive
# Category/Feature Symbol – semantics
1 Word class past participle: VBN, BEN, DON, HVN, RDN
2 Case N nominative, A accusative, D dative, G genitive
Examples VBN-A verb, past participle, accusative; DON-D GERA, past participle, dative; HVN-G HAFA, past participle, genitive

Prepositions

# Category/Feature Symbol – semantics
1 Word class P preposition

Adverbs

# Category/Feature Symbol – semantics
1 Word class ADV–adverb, ADVR comparative, ADVS superlative

Simple Tags

  • C - complementizer
  • CONJ - conjunction
  • FW - foreign word
  • NEG - negation, ekki, eigi, ei
  • TO - infinitival marker , 'to'.
  • X - unanalyzed word