There are many assemblers that have different algorithms to assemble a de novo transcriptome. At the same time, the filtering stage, being one of the key stages, also has several approaches and algorithms. However, to date, there is very little work on the influence of filtration degree on the de novo transcriptome Assembly. In this paper, we analyzed transcripts obtained using two of the most common programs (rnaSPADES and Trinity), and applied various approaches to the stage of filtering readings. Key differences were shown for the two assemblies and parameters were identified that were sensitive to the degree of filtering and the length of input reads. We also proposed an effective filtering algorithm that is two-stage and allows you to save the maximum amount of input data with the necessary quality of all readings after filtering and cropping.
RNA-seq, rnaSPADES, Trinity, de novo transcriptome assembly, read filtering
