Galaxy-compatible Tool for Rapid Aptamer Clustering and HT-SELEX Data Analysis
Main Article Content
Abstract
Aim: Implementing deep sequencing for analysis of DNA aptamer selection results requires for specialized bioinformatic software. Analysis steps include search for homologous sequences, clustering, and comparing cluster enrichment across different samples. These procedures allow deeper characterization of selected sequences by target affinity, non-specific amplification, and off-target binding, thus highlighting most promising variants or motifs. Materials and Methods: Sequencing results of systematic evolution of ligands by exponential enrichment for 40 nucleotide aptamers against extracellular CD47 protein were used as datasets for comparative clustering. Modified fast clustering script was developed based on FASTAptamer-Cluster and adapted as a galaxy tool. The algorithm was modified to terminate calculations after achieving the threshold value, and an exceeding edit distance was then assigned to non-matching pair of sequences. Results and Discussion: We have developed a set of galaxy compatible applications for rapid clustering of sequencing results and further comparative analysis of clusters. Our clustering algorithm is specifically optimized for searching for highly homologous sequences that usually form aptamer clusters and provides an average 8.4-fold increase in speed. Conclusion: Our modified clustering algorithm substantially surpasses existing alternatives in speed, thus simplifying analysis of large data sets, while its Galaxy version allows easy integration in standard workflows for preprocessing and analysis of the deep sequencing results.
Downloads
Download data is not yet available.
Article Details
How to Cite
Skrylnik, N. A. (2017). Galaxy-compatible Tool for Rapid Aptamer Clustering and HT-SELEX Data Analysis. Asian Journal of Pharmaceutics (AJP), 11(02). https://doi.org/10.22377/ajp.v11i02.1265
Section
ORIGINAL ARTICLES
This is an Open Access article distributed under the terms of the Attribution-Noncommercial 4.0 International License [CC BY-NC 4.0], which requires that reusers give credit to the creator. It allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, for noncommercial purposes only.