Additional file 1: of Pipeline for the identification and classification of ion channels in parasitic flatworms

Table S1. Sequence counts per ion channel family obtained from the KEGG and SwissProt databases and included in the training and test datasets. Table S2. Accession numbers of ion channels selected for support vector machine model training. Table S3. The number of sequences in the testing dataset before and after BLASTp analyses. Table S4. The number of identified test data sequences from humans and C. elegans within each group and divided into known ion channel and non-ion channel datasets. Table S5. Cross-validation, training and testing accuracies of each model. Table S6. Final tables of confusion matrices for the “Classifier” and “Dipeptide” models. Table S7. Summary of flatworm ion channels predicted using the MuSICC identification and classification pipeline with high and medium confidence. Table S8. Complete set of flatworm ion channels predicted using the MuSICC identification and classification pipeline. (XLS 2960 kb)