Difference between revisions of "Tagset"
From Icelandic Parsed Historical Corpus (IcePaHC)
(→Adjectives) |
(→Tags with postdashial subfeatures) |
||
Line 6: | Line 6: | ||
==Tags with postdashial subfeatures== | ==Tags with postdashial subfeatures== | ||
− | ===Nouns=== | + | ====Nouns==== |
{| | {| | ||
!Char# || Category/Feature || Symbol – semantics | !Char# || Category/Feature || Symbol – semantics | ||
Line 23: | Line 23: | ||
|} | |} | ||
− | ===Adjectives=== | + | ====Adjectives==== |
{| | {| | ||
!Char# || Category/Feature || Symbol – semantics | !Char# || Category/Feature || Symbol – semantics | ||
Line 40: | Line 40: | ||
|} | |} | ||
− | ===Pronouns=== | + | ====Pronouns==== |
{| | {| | ||
!Char# || Category/Feature || Symbol – semantics | !Char# || Category/Feature || Symbol – semantics | ||
Line 55: | Line 55: | ||
|} | |} | ||
− | ===Article (determiner)=== | + | ====Article (determiner)==== |
{| | {| | ||
!Char# || Category/Feature || Symbol – semantics | !Char# || Category/Feature || Symbol – semantics | ||
Line 68: | Line 68: | ||
|} | |} | ||
− | ===Numbers=== | + | ====Numbers==== |
{| | {| | ||
!Char# || Category/Feature || Symbol – semantics | !Char# || Category/Feature || Symbol – semantics | ||
Line 83: | Line 83: | ||
|} | |} | ||
− | ===Verbs=== | + | ====Verbs==== |
{| | {| | ||
!Char# || Category/Feature || Symbol – semantics | !Char# || Category/Feature || Symbol – semantics | ||
Line 117: | Line 117: | ||
|} | |} | ||
− | ===Prepositions=== | + | ====Prepositions==== |
{| | {| | ||
!Char# || Category/Feature || Symbol – semantics | !Char# || Category/Feature || Symbol – semantics | ||
Line 126: | Line 126: | ||
|} | |} | ||
− | ===Adverbs=== | + | ====Adverbs==== |
{| | {| | ||
!Char# || Category/Feature || Symbol – semantics | !Char# || Category/Feature || Symbol – semantics |
Revision as of 21:03, 28 January 2010
This is the head level tagset used in the Icelandic Treebank. The tagset is based on the IFD Tagset. Each head is assigned a tag (N-NSDIC in the example below):
<synttree>[NP[N-NSDIC[barni]]]</synttree>
In the example barni is a dative form of the Icelandic word for 'child'. The first part, before the dash, always represents the word class, e.g. N for Noun. The extension of the above tag, NSDIC, means: Neuter, Singular, Dative, Indefinite (no suffixed article), Common Noun. The predashial word class can be one or more characters while the postdashial subfeatures are always one character per feature.
Contents
Tags with postdashial subfeatures
Nouns
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | N–noun |
2 | Gender | M–masculine, F–feminine, N–neuter, X–unspecified |
3 | Number | S–singular, P–plural |
4 | Case | N–nominative, A–accusative, D–dative, G–genitive |
5 | Article | I-without suffixed article (indefnite), D–with suffixed definite article (definite) |
6 | Proper/Common | C-common noun, P–person name, L-place name, O–other proper name |
Adjectives
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | ADJ–adjective |
2 | Gender | M–masculine, F–feminine, N–neuter, X-unspecified |
3 | Number | S–singular, P–plural |
4 | Case | N–nominative, A–accusative, D–dative, G–genitive |
5 | Declension | S–strong declension, W–weak declension, X–indeclineable |
6 | Degree | P–positive, C–comparative, S–superlative |
Pronouns
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | PRO–pronoun |
2 | Subcategory | D–demonstrative, B–indefinite demonstrative (Icel. 'óákveðið ábendingarfornafn'), Q–possessive, X–indefinite (Icel. 'óákveðið'), P–personal, W–interrogative, R–relative |
3 | Gender/Person | M–masculine, F–feminine, N–neuter/1–1st person, 2–2nd person |
4 | Number | S–singular, P–plural |
5 | Case | N–nominative, A–accusative, D–dative, G–genitive |
Article (determiner)
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | D–article (determiner) |
2 | Gender | M-masculine, F–feminine, N–neuter |
3 | Number | S–singular, P–plural |
4 | Case | N–nominative, A–accusative, D–dative, G–genitive |
Numbers
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | NUM–numeral |
2 | Category | P-(málfr. frumtala, þ.e. ekki raðtala???), F-percentage (fraction), O-other |
3 | Gender | M–masculine, F–feminine, N–neuter |
4 | Number | S–singular, P–plural |
5 | Case | N–nominative, A–accusative, D-dative, G–genitive |
Verbs
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | V–verb (except for past participle) |
2 | Mood | T–infinitive, M–imperative, I–indicative, S–subjunctive, U–supine, P–present participle, D-past participle |
3 | Voice | A–active, M–middle |
4 | Person | 1–1st person, 2–2nd person, 3–3rd person, |
5 | Number | S–singular, P–plural |
6 | Tense | P–present, D–past |
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | V–verb (past participle) |
2 | Tense | D–past |
3 | Voice | A–active, M–middle |
4 | Gender | M–masculine, F–feminine, N–neuter |
5 | Number | S–singular, P–plural |
6 | Case | N–nominative, A–accusative, D–dative, G–genitive |
Prepositions
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | P–preposition |
2 | Case governed | A–governs accusative, D–governs dative, G–governs genitive |
Adverbs
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | ADV–adverb |
2 | Category | N–normal, I–exclamation |
3 | Degree | C–comparative, S–superlative |
Simple Tags
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | CONJ–conjunction |
2 | Category | I–sign of infinitive, R–relative conjunction, |
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | FOREIGN–foreign word |
Char# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | X–unanalyzed word |