Efficiently grouping similar DNA sequences to remove duplicates (2019)(blog.liudaniel.com) |
Efficiently grouping similar DNA sequences to remove duplicates (2019)(blog.liudaniel.com) |
This is not really explained in the blog post, but the "naive" method is O(N^2) brute force search, and the "combos" method is recursively going through all combinations of UMIs within a certain edit distance. There are also some other variants that are evaluated.
If you have any questions, feel free to ask me!