Microbiome composition and implications for ballast water classification using machine learning

2019 
Abstract Ballast water is a vector for global translocation of microorganisms, and should be monitored to protect human and environmental health. This study utilizes high throughput sequencing (HTS) and machine learning to examine the bacterial and fungal microbiomes of ballast water to identify associations between 16S and 18S rRNA genes and the fungal ITS region. These sequencing regions were examined using the SILVA v132 and UNITE reference databases. The highest correlation was found between the communities in Silva_16S and UNITE_ITS (0.74). There was a higher proportion of positive inter-kingdom correlations than positive intra-kingdom interactions (p = 0.032). Understanding the reasons for this difference requires additional research under more controlled conditions. Finally, a machine learning model was used to examine the classification accuracy when using each sequencing region and reference database to identify ballast residence time and ballast sample location. There was significantly higher accuracy using SILVA (0.843) compared to UNITE (0.614) (p
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    40
    References
    6
    Citations
    NaN
    KQI
    []