Part-of-speech tagging from the CoNLL 2000 data. Extended with morphological features (http://code.google.com/p/miralium/source/browse/trunk/examples/pos_features.py). This N Y N T Th Thi This s is his This DT.
Format:
SequenceTagging
User:
favre
Size:
12M
Status:
processed
Access:
open
Download:
Login required!
#trainEx
8936
#testEx
2012
#trainTok
211727
#testTok
47377
#featureCols
12
#labels
44
exitCode: 0
success: true
test:
labels: # $ '' ( ) , . : CC CD DT EX FW IN JJ JJR JJS MD NN NNP NNPS NNS PDT POS PRP PRP$ RB RBR RBS RP TO UH VB VBD VBG VBN VBP VBZ WDT WP WP$ WRB ``
numExamples: 2012
numFeatureColumns: 12
numLabels: 43
numTokens: 47377
success: true
time: 1
train:
labels: # $ '' ( ) , . : CC CD DT EX FW IN JJ JJR JJS MD NN NNP NNPS NNS PDT POS PRP PRP$ RB RBR RBS RP SYM TO UH VB VBD VBG VBN VBP VBZ WDT WP WP$ WRB ``
numExamples: 8936
numFeatureColumns: 12
numLabels: 44
numTokens: 211727
success: true
time: 5
Comments:
Post comment:
Must be logged in to post comments.