ITU Validation Set

ITU Validation Set is a dependency treebank of 300 sentences from 3 different genres. The details of the data set may be obtained from here.

There are 2 versions available:

ITU Validation Set V1: Multi Word Expressions are expressed as single units in sentences. (download)

ITU Validation Set V2: Multi Word Expressions are expressed as seperate units in sentences (with a dependency label "MWE" in between). (download)

METU-Sabancı Turkish Dependency Treebank

New version with Multi Word Expressions splitted into multiple units and manually tagged. After this splitting the new occuring dependencies between MWE tokens are labeled with a "MWE" dependency label and the entire dependencies within sentences are rearanged according to this.

This new version of the treebank is build upon the original treebank which may be obtained from

The treebank may be downloaded from here.

