MATLAB: Gene contains only ATGC

atgcgenesnucleotides

I have a list of genes (strings of letters) and I want delete the ones that has letters other than ATGC (N for example). how can I do it?
Thank you!

Best Answer

You just want to remove the invalid letters? The code below will do that. If you want to check if a gene string is invalid you can check that with any(~L).
gene='ATTCAGN';
L=ismember(gene,'ATGC');
gene=gene(L);
Related Question