Additional file 1: Figure S1. of Improved annotation with de novo transcriptome assembly in four social amoeba species

TAGC plot for D. fasciculatum (A) and D. lacteum (B) before and after filtering. Each colour blob represents different taxa with unmatched transcripts are shaded in grey. The unannotated grey coloured transcripts after filtration set further filtered by high GC and low read coverage. This plot shows a major blob of transcripts that are annotated with the Dictyostelium fasciculatum species with high coverage and lower GC content. Other contaminations form E.coli, pseudomonas fluorescence and other species has also been highlighted with different colours. These contaminations clearly make different blobs with lower read coverage and high GC content. However, it’s good to see that there are some other transcripts that showing matched to dictyostelium discoideum- that clearly reflect the presence of some novel unannotated transcripts in the new assembly. Figure S2 A comparison of assembled transcripts read count. The boxplots represent the range between the 1st and 3rd quartiles of the data by the coloured boxes, the median is the horizontal bar and points shown beyond the whiskers are >95% of the data. Table S3 Transrate good contigs. Table S4 Olignucleotide sequences. Table S5 Alignment with DNA sequence of PCR product. (DOCX 787 kb)