Difference between revisions of "Tagset"

From Icelandic Parsed Historical Corpus (IcePaHC)
Jump to: navigation, search
(Adjectives)
(Tags with postdashial subfeatures)
Line 6: Line 6:
  
 
==Tags with postdashial subfeatures==
 
==Tags with postdashial subfeatures==
===Nouns===
+
====Nouns====
 
{|
 
{|
 
!Char# || Category/Feature || Symbol – semantics
 
!Char# || Category/Feature || Symbol – semantics
Line 23: Line 23:
 
|}
 
|}
  
===Adjectives===
+
====Adjectives====
 
{|
 
{|
 
!Char# || Category/Feature || Symbol – semantics
 
!Char# || Category/Feature || Symbol – semantics
Line 40: Line 40:
 
|}
 
|}
  
===Pronouns===
+
====Pronouns====
 
{|  
 
{|  
 
!Char# || Category/Feature || Symbol – semantics
 
!Char# || Category/Feature || Symbol – semantics
Line 55: Line 55:
 
|}
 
|}
  
===Article (determiner)===
+
====Article (determiner)====
 
{|  
 
{|  
 
!Char# || Category/Feature || Symbol – semantics
 
!Char# || Category/Feature || Symbol – semantics
Line 68: Line 68:
 
|}
 
|}
  
===Numbers===
+
====Numbers====
 
{|  
 
{|  
 
!Char# || Category/Feature || Symbol – semantics
 
!Char# || Category/Feature || Symbol – semantics
Line 83: Line 83:
 
|}
 
|}
  
===Verbs===
+
====Verbs====
 
{|  
 
{|  
 
!Char# || Category/Feature || Symbol – semantics
 
!Char# || Category/Feature || Symbol – semantics
Line 117: Line 117:
 
|}
 
|}
  
===Prepositions===
+
====Prepositions====
 
{|  
 
{|  
 
!Char# || Category/Feature || Symbol – semantics
 
!Char# || Category/Feature || Symbol – semantics
Line 126: Line 126:
 
|}
 
|}
  
===Adverbs===
+
====Adverbs====
 
{|  
 
{|  
 
!Char# || Category/Feature || Symbol – semantics
 
!Char# || Category/Feature || Symbol – semantics

Revision as of 21:03, 28 January 2010

This is the head level tagset used in the Icelandic Treebank. The tagset is based on the IFD Tagset. Each head is assigned a tag (N-NSDIC in the example below):

<synttree>[NP[N-NSDIC[barni]]]</synttree>

In the example barni is a dative form of the Icelandic word for 'child'. The first part, before the dash, always represents the word class, e.g. N for Noun. The extension of the above tag, NSDIC, means: Neuter, Singular, Dative, Indefinite (no suffixed article), Common Noun. The predashial word class can be one or more characters while the postdashial subfeatures are always one character per feature.

Tags with postdashial subfeatures

Nouns

Char# Category/Feature Symbol – semantics
1 Word class N–noun
2 Gender M–masculine, F–feminine, N–neuter, X–unspecified
3 Number S–singular, P–plural
4 Case N–nominative, A–accusative, D–dative, G–genitive
5 Article I-without suffixed article (indefnite), D–with suffixed definite article (definite)
6 Proper/Common C-common noun, P–person name, L-place name, O–other proper name

Adjectives

Char# Category/Feature Symbol – semantics
1 Word class ADJ–adjective
2 Gender M–masculine, F–feminine, N–neuter, X-unspecified
3 Number S–singular, P–plural
4 Case N–nominative, A–accusative, D–dative, G–genitive
5 Declension S–strong declension, W–weak declension, X–indeclineable
6 Degree P–positive, C–comparative, S–superlative

Pronouns

Char# Category/Feature Symbol – semantics
1 Word class PRO–pronoun
2 Subcategory D–demonstrative, B–indefinite demonstrative (Icel. 'óákveðið ábendingarfornafn'), Q–possessive, X–indefinite (Icel. 'óákveðið'), P–personal, W–interrogative, R–relative
3 Gender/Person M–masculine, F–feminine, N–neuter/1–1st person, 2–2nd person
4 Number S–singular, P–plural
5 Case N–nominative, A–accusative, D–dative, G–genitive

Article (determiner)

Char# Category/Feature Symbol – semantics
1 Word class D–article (determiner)
2 Gender M-masculine, F–feminine, N–neuter
3 Number S–singular, P–plural
4 Case N–nominative, A–accusative, D–dative, G–genitive

Numbers

Char# Category/Feature Symbol – semantics
1 Word class NUM–numeral
2 Category P-(málfr. frumtala, þ.e. ekki raðtala???), F-percentage (fraction), O-other
3 Gender M–masculine, F–feminine, N–neuter
4 Number S–singular, P–plural
5 Case N–nominative, A–accusative, D-dative, G–genitive

Verbs

Char# Category/Feature Symbol – semantics
1 Word class V–verb (except for past participle)
2 Mood T–infinitive, M–imperative, I–indicative, S–subjunctive, U–supine, P–present participle, D-past participle
3 Voice A–active, M–middle
4 Person 1–1st person, 2–2nd person, 3–3rd person,
5 Number S–singular, P–plural
6 Tense P–present, D–past


Char# Category/Feature Symbol – semantics
1 Word class V–verb (past participle)
2 Tense D–past
3 Voice A–active, M–middle
4 Gender M–masculine, F–feminine, N–neuter
5 Number S–singular, P–plural
6 Case N–nominative, A–accusative, D–dative, G–genitive

Prepositions

Char# Category/Feature Symbol – semantics
1 Word class P–preposition
2 Case governed A–governs accusative, D–governs dative, G–governs genitive

Adverbs

Char# Category/Feature Symbol – semantics
1 Word class ADV–adverb
2 Category N–normal, I–exclamation
3 Degree C–comparative, S–superlative

Simple Tags

Char# Category/Feature Symbol – semantics
1 Word class CONJ–conjunction
2 Category I–sign of infinitive, R–relative conjunction,


Char# Category/Feature Symbol – semantics
1 Word class FOREIGN–foreign word


Char# Category/Feature Symbol – semantics
1 Word class X–unanalyzed word