Databasewebkb-4-universities-wisconsin-test
The 4 Universities Data Set. From the description: "This data set contains WWW-pages collected from computer science departments of various universities in January 1997 by the World Wide Knowledge Base(Web->Kb) project of the CMU text learning group. The 8,282 pages were manually classified into the following categories: student (1641), faculty (1124), staff (137), department (182), course (930), project (504), other (3764)." This MLcomp dataset includes the pages from cornell, texas, washington and misc in the training set and uses the wisconsin pages as the test set. The MIME headers and HTML tags were removed. See: [ http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/ ]
DocumentClassification
gijs
20M
processed
open
Login required!
7019
1263
7


Run a program on this dataset Arrow_right


Existing runs on webkb-4-universities-wisconsin-test 1-6 of 6   Action_refresh_blue
ID Program Dataset Tuned hyper. User Updated Status Total time Memory Error >>
Run #8257 document-classification-majority webkb-4-universities-wisconsin-test no internal 2y255d ago done 1s 28M 0.254
Run #8260 libtextcat-2.2-wrap webkb-4-universities-wisconsin-test no internal 2y255d ago done 7m40s 44M 0.686
Run #8263 dictionary-method-language webkb-4-universities-wisconsin-test no internal 2y255d ago done 36s 1173M 0.749
Run #8251 icsiboost-bigram webkb-4-universities-wisconsin-test no internal 2y255d ago failed 3m24s 37M
Run #8253 icsiboost webkb-4-universities-wisconsin-test no internal 2y255d ago failed 5m7s 37M
Run #8255 boostexter-bigram webkb-4-universities-wisconsin-test no internal 2y255d ago failed 8m10s 37M


Processing details Arrow_right


Comments:


Must be logged in to post comments.