The AFFYREAD structure has been changed for the Bioinformatics Toolbox V3.0 - which is the toolbox release in conjunction with MATLAB 7.5 (R2007b).
The new GroupNumber column gives the same information as the ProbeSetNumber subfield did, except that the GroupNumber indexing is 1-based while the ProbeSetNumber was 0-based (in accordance with Affymetrix standards).
ProbePairNumber, however, was removed due to the perceived redundancy in the information it provided; the probe pair numbers are simply the row indices of the matrices contained in the ProbeSets field of the returned struct (each row represents the readings of one probe pair).
While the ProbePairNumber column was removed, it is possible to compute the probe pair numbers from the ProbeSets matrices, and if desired, insert them back into the returned struct. Shown below is code that would perform this:
for n = 1:length(affy_struct.ProbeSets)
CurPairs = affy_struct.ProbeSets(n).ProbePairs;
CurPairNumbers = (1:length(CurPairs));
CurPairNumbers = CurPairNumbers(:);
CurPairs = [CurPairs(:,1) CurPairNumbers CurPairs(:,2:end)];
affy_struct.ProbeSets(n).ProbePairs = CurPairs;
end
where affy_struct is the struct obtained by reading a CDF file with AFFYREAD.
You may also then insert 'ProbePairNumber' as one of the names in affy_struct.ProbeSetColumnNames by executing the following code:
CurNames = affy_struct.ProbeSetColumnNames;
CurNames = {CurNames{1} 'ProbePairNumber' CurNames{2:end}};
affy_struct.ProbeSetColumnNames = CurNames;
Best Answer