From: The TREC 2004 genomics track categorization task: classifying full text biomedical documents
Data Set
Positive Samples
Negative Samples
Total Samples
Training (year 2002)
375
5462
5837
Test (year 2003)
420
5623
6043