Identifying Trx-fold Protein


  1. Chang Wang, Stephen D. Scott, Jun Zhang, Qingping Tao, Dmitri E. Fomenko, and Vadim N. Gladyshev. A Study in Modeling Low-Conservation Protein Superfamilies. Technical report UNL-CSE-2004-0003, University of Nebraska, 2004. [pdf]  (With full description of the source of the data. The data set provided here is referred as "MIL (Motif-based alignment)". )
  2. Qingping Tao, Stephen D. Scott and N. V. Vinodchandran. SVM-Based Generalized Multiple-Instance Learning via Approximate Box Counting. In Proceedings of the Twenty-First International Conference on Machine Learning (ICML 2004), pages 779-806, Banff, Alberta, Canada, July 2004. [pdf]
  1. Database file: primary.des.db
  2. Specification file: primary.des.spec
  3. Partitions for jackknife test
    a. positive examples: pos.svm
    b. negative examples: neg.svm.*
            1    3    1 1 1  2 2 2  3 3 3
<number of classes>
<class label 1>    <class label 2>    ... ...
<dummy, no meaning>
<number of dimensions>
<minimum value for dimension 1>    <maximum value for dimension 1>
<minimum value for dimension 2>    <maximum value for dimension 2>
... ...