MATLAB: Splitting a table into smaller ones based on two columns

datafindindexingsorttable

Hello,

The code I use produces a master table with 8 columns and around 800 rows. I would like to sort the data in the following steps:

Take first line of data from master table and look at the values in column six and eight.
Find all other rows in the table with a column six value within 0.5 of the line one column six value and a column eight value within 2 of the line one column eight value.
Create new table with this data.
Delete sorted data from master table.
Repeat 1-> 4 until no data left.
Be left with multiple tables.

Is there a way to do this?

Thank you for your help.

Best Answer

That can easily be done in a loop but it doesn't sound like a good idea. Breaking apart well-organized tabular data into sub-tables is like moving into a new house by unpacking each box in the driveway and carrying in each item from the box individually rather than just carying in the box. Keep the data together whenever possible.

Instead, each row of the table can be assigned a subgroup number and then you can use those row numbers to pull out data as needed.

Here's a functional demo with comments to illustrate this method. 'rowGroup' is used to identify subtable rows.

% Create demo data
T = array2table(rand(20,8).*2); 
T{:,8} = T{:,8} * 5; 
% Identify the group number of each row based on
% col 6 & 8 values and their given tolerance levels.
rowGroup = zeros(size(T,1),1); % This will store the group number for each row
while any(rowGroup==0)
    % find next unassigned row, starting at the top
    rowNum = find(rowGroup==0,1,'first'); 
    % find all rows in col 6 that are within tolerance 
    group1idx = abs(T{rowNum,6} - T{:,6}) <= 0.5; 
    % find all rows in col 8 that are within tolerance 
    group2idx = abs(T{rowNum,8} - T{:,8}) <= 2.0; 
    % identify the rows that fit into this group
    rowGroup(group1idx & group2idx & rowGroup==0) = max(rowGroup)+1;    
end
% rowGroup is a column vector of row numbers that identify the subgroups. 
% Your values will differ due to using random data
% >> rowGroup(1:5)
% ans =
%      1 
%      2
%      3
%      4

%      4
% now you can access sub-groups of data like this 
T(rowGroup==1,:) % for group 1

To see the number of subgroups and the number of rows within each subgroup,

subgroupSummary = table((min(rowGroup):max(rowGroup))', ...
    histcounts(rowGroup,min(rowGroup):max(rowGroup)+1)', ...
    'VariableNames', {'Group', 'nRows'})

Related Solutions

MATLAB: How to efficiently make the for loop of different dot structure compact

I think, this latest work-around work better than the original version.. However, I welcome any new suggestion.

sixEarly = (reshape ((result (:,1:96)),[12,8])).';
eightRoster = (reshape ((result (:,97:176)),[10,8])).';
sixLate = (reshape ((result (:,177:end)),[12,8])).';
% Do all the transformation under the function TABLECREATION
Tab_sixLate =tableCreation (at,sixLate);
Tab_sixEarly =tableCreation (at,sixEarly);
Tab_eightRoster =tableCreation (at,eightRoster);
function Table_Converted =tableCreation (at,NonConvertedCell)
Table_Converted = cell2table( NonConvertedCell ) ;
ListPat = (Table_Converted{:,1}).';
ListCondType = Table_Converted{1,:};
GetPatName = cellfun(@(v) v(1), ListPat(1,:));
GetCondType = cellfun(@(v) v(1), ListCondType(1,:));
% But still, I had to use two for loop here, mmm
for i=1:length  (GetPatName)
      Table_Converted.Properties.RowNames{i} = char ((at.sub.ID ((GetPatName(i)),1)));
  end
for i =1:length  (GetCondType)
Table_Converted.Properties.VariableNames{i} = char ((at.sub.Shiftday ((GetCondType(i)),1)));
end
end

MATLAB: How to make heatmap y-axis based on column names, not column values

data = [0 280 1170 0; 62 0 0 0];
VariableNames = {'colA', 'colB', 'colC', 'colD'};
rowid= {'rowA', 'rowB'};
hHM=heatmap(rowid,VariableNames,data.','ColorbarVisible','off');

seems to do the trick.

Looks more difficult (but probably still possible with judicious selection of parameters and data from the table) after in the table than with the raw data.

Best Answer

Related Solutions

MATLAB: How to efficiently make the for loop of different dot structure compact

MATLAB: How to make heatmap y-axis based on column names, not column values

Related Question