Difference between revisions of "Tagset"
From Icelandic Parsed Historical Corpus (IcePaHC)
(→Tags with postdashial subfeatures) |
(→Simple Tags) |
||
Line 140: | Line 140: | ||
{| | {| | ||
− | ! | + | !# || Category/Feature || Symbol – semantics |
|- | |- | ||
|1 || Word class || '''CONJ'''–conjunction | |1 || Word class || '''CONJ'''–conjunction | ||
Line 149: | Line 149: | ||
{| | {| | ||
− | ! | + | !# || Category/Feature || Symbol – semantics |
|- | |- | ||
|1 || Word class || '''FOREIGN'''–foreign word | |1 || Word class || '''FOREIGN'''–foreign word | ||
Line 156: | Line 156: | ||
{| | {| | ||
− | ! | + | !# || Category/Feature || Symbol – semantics |
|- | |- | ||
|1 || Word class || '''X'''–unanalyzed word | |1 || Word class || '''X'''–unanalyzed word | ||
|} | |} |
Revision as of 21:05, 28 January 2010
This is the head level tagset used in the Icelandic Treebank. The tagset is based on the IFD Tagset. Each head is assigned a tag (N-NSDIC in the example below):
<synttree>[NP[N-NSDIC[barni]]]</synttree>
In the example barni is a dative form of the Icelandic word for 'child'. The first part, before the dash, always represents the word class, e.g. N for Noun. The extension of the above tag, NSDIC, means: Neuter, Singular, Dative, Indefinite (no suffixed article), Common Noun. The predashial word class can be one or more characters while the postdashial subfeatures are always one character per feature.
Contents
Tags with postdashial subfeatures
Nouns
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | N–noun |
2 | Gender | M–masculine, F–feminine, N–neuter, X–unspecified |
3 | Number | S–singular, P–plural |
4 | Case | N–nominative, A–accusative, D–dative, G–genitive |
5 | Article | I-without suffixed article (indefnite), D–with suffixed definite article (definite) |
6 | Proper/Common | C-common noun, P–person name, L-place name, O–other proper name |
Adjectives
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | ADJ–adjective |
2 | Gender | M–masculine, F–feminine, N–neuter, X-unspecified |
3 | Number | S–singular, P–plural |
4 | Case | N–nominative, A–accusative, D–dative, G–genitive |
5 | Declension | S–strong declension, W–weak declension, X–indeclineable |
6 | Degree | P–positive, C–comparative, S–superlative |
Pronouns
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | PRO–pronoun |
2 | Subcategory | D–demonstrative, B–indefinite demonstrative (Icel. 'óákveðið ábendingarfornafn'), Q–possessive, X–indefinite (Icel. 'óákveðið'), P–personal, W–interrogative, R–relative |
3 | Gender/Person | M–masculine, F–feminine, N–neuter/1–1st person, 2–2nd person |
4 | Number | S–singular, P–plural |
5 | Case | N–nominative, A–accusative, D–dative, G–genitive |
Article (determiner)
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | D–article (determiner) |
2 | Gender | M-masculine, F–feminine, N–neuter |
3 | Number | S–singular, P–plural |
4 | Case | N–nominative, A–accusative, D–dative, G–genitive |
Numbers
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | NUM–numeral |
2 | Category | P-(málfr. frumtala, þ.e. ekki raðtala???), F-percentage (fraction), O-other |
3 | Gender | M–masculine, F–feminine, N–neuter |
4 | Number | S–singular, P–plural |
5 | Case | N–nominative, A–accusative, D-dative, G–genitive |
Verbs
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | V–verb (except for past participle) |
2 | Mood | T–infinitive, M–imperative, I–indicative, S–subjunctive, U–supine, P–present participle, D-past participle |
3 | Voice | A–active, M–middle |
4 | Person | 1–1st person, 2–2nd person, 3–3rd person, |
5 | Number | S–singular, P–plural |
6 | Tense | P–present, D–past |
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | V–verb (past participle) |
2 | Tense | D–past |
3 | Voice | A–active, M–middle |
4 | Gender | M–masculine, F–feminine, N–neuter |
5 | Number | S–singular, P–plural |
6 | Case | N–nominative, A–accusative, D–dative, G–genitive |
Prepositions
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | P–preposition |
2 | Case governed | A–governs accusative, D–governs dative, G–governs genitive |
Adverbs
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | ADV–adverb |
2 | Category | N–normal, I–exclamation |
3 | Degree | C–comparative, S–superlative |
Simple Tags
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | CONJ–conjunction |
2 | Category | I–sign of infinitive, R–relative conjunction, |
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | FOREIGN–foreign word |
# | Category/Feature | Symbol – semantics |
---|---|---|
1 | Word class | X–unanalyzed word |