The object pronoun ÞAÐ is always tagged PRO. The subject pronoun ÞAÐ is tagged as PRO in any syntactic context where it *never* disappears under subject-finite-verb inversion. In Icelandic, when the finite verb fronts over the subject under V-to-C movement in matrix clauses or embedded topicalizations, truly non-expletive ÞAÐ will still surface in subject position, but truly expletive ÞAÐ will disappear. In any syntactic context in which ÞAÐ would disappear under subject-verb inversion, it is tagged ES (in accordance with Caitlin Light's Early New High German corpus). Such contexts include, at least: weather expressions, impersonals, and existentials.
ÞAÐ is tagged ES even in contexts where there is inter-speaker or inter-text (or diachronic) variation with regard to whether it disappears.
See Expletives also, for *exp*, which is the empty category corresponding to "(ES ÞAÐ)". Note that when ÞAÐ disappears under verb-movement, "(NP-SBJ *exp*)" is only inserted in the sentence if there is no other possible subject (e.g. it is not inserted in subject-postposition constructions).
ÞAÐ meaning AÐ
When ÞAÐ means AÐ it is tagged C, lemmatized að, and parsed accordingly. Examples of this can be found in Oddur Gottskálksson's translation of the New Testament.
( (IP-MAT-SPE (CODE VS:VI_69J) (NP-SBJ (PRO-N vér-ég)) (VBPI (VBPI trúum-trúa) (CONJ og-og) (VBPI kennum-kenna)) (CP-THT-SPE (C það-að) (IP-SUB-SPE (NP-SBJ (PRO-N þú-þú)) (BEPI ert-vera) (NP-PRD (NPR-N Kristur-kristur) (, ,-,) (NP-PRN (N-N sonur-sonur) (NP-POS (NPR-G Guðs-guður) (VAG-G lifanda-lifandi)))))) (. .-.)))
Since this most frequently occurs in the Bible translation in our corpus, decisions about tagging ÞAÐ as C are sometimes supported by a comparison with other translations of the Bible. For example, the above quote from John 6:69 is as follows in King James Bible:
- And we believe and are sure that thou art that Christ, the Son of the living God.