MATLAB: How to store the detected object in video in a folder in each frame?

I want to store the detected object that is inside the yellow box and store each object with his label name in a separate file like object 1 in folder 1 object 2 in folder 2 only I want to save the detected object in the frame. Image:

Code: if true %% Motion-Based Multiple Object Tracking % This example shows how to perform automatic detection and motion-based % tracking of moving objects in a video from a stationary camera. % % Copyright 2014 The MathWorks, Inc.

Detection of moving objects and motion-based tracking are important components of many computer vision applications, including activity recognition, traffic monitoring, and automotive safety. The problem of motion-based object tracking can be divided into two parts:

detecting moving objects in each frame
associating the detections corresponding to the same object over time

The detection of moving objects uses a background subtraction algorithm based on Gaussian mixture models. Morphological operations are applied to the resulting foreground mask to eliminate noise. Finally, blob analysis detects groups of connected pixels, which are likely to correspond to moving objects.

The association of detections to the same object is based solely on motion. The motion of each track is estimated by a Kalman filter. The filter is used to predict the track's location in each frame, and determine the likelihood of each detection being assigned to each track.

Track maintenance becomes an important aspect of this example. In any given frame, some detections may be assigned to tracks, while other detections and tracks may remain unassigned.The assigned tracks are updated using the corresponding detections. The unassigned tracks are marked invisible. An unassigned detection begins a new track.

Each track keeps count of the number of consecutive frames, where it remained unassigned. If the count exceeds a specified threshold, the example assumes that the object left the field of view and it deletes the track.

This example is a function with the main body at the top and helper routines in the form of nested functions below.



    function multiObjectTracking()

    % Create System objects used for reading video, detecting moving objects,
    % and displaying the results.
    obj = setupSystemObjects();

    tracks = initializeTracks(); % Create an empty array of tracks.

    nextId = 1; % ID of the next track

    % Detect moving objects, and track them across video frames.
    while ~isDone(obj.reader)
        frame = readFrame();
        [centroids, bboxes, mask] = detectObjects(frame);
        predictNewLocationsOfTracks();
        [assignments, unassignedTracks, unassignedDetections] = ...
            detectionToTrackAssignment();

        updateAssignedTracks();
        updateUnassignedTracks();
        deleteLostTracks();
        createNewTracks();

        displayTrackingResults();
    end

    %%Create System Objects
    % Create System objects used for reading the video frames, detecting
    % foreground objects, and displaying results.

        function obj = setupSystemObjects()
            % Initialize Video I/O
            % Create objects for reading a video from a file, drawing the tracked
            % objects in each frame, and playing the video.

            % Create a video file reader.
            obj.reader = vision.VideoFileReader('Video.avi');

            % Create two video players, one to display the video,
            % and one to display the foreground mask.
            obj.videoPlayer = vision.VideoPlayer('Position', [20, 400, 700, 400]);
            obj.maskPlayer = vision.VideoPlayer('Position', [740, 400, 700, 400]);

            % Create System objects for foreground detection and blob analysis

            % The foreground detector is used to segment moving objects from
            % the background. It outputs a binary mask, where the pixel value
            % of 1 corresponds to the foreground and the value of 0 corresponds
            % to the background. 

            obj.detector = vision.ForegroundDetector('NumGaussians', 3, ...
                'NumTrainingFrames', 40, 'MinimumBackgroundRatio', 0.7);

            % Connected groups of foreground pixels are likely to correspond to moving
            % objects.  The blob analysis System object is used to find such groups
            % (called 'blobs' or 'connected components'), and compute their
            % characteristics, such as area, centroid, and the bounding box.

            obj.blobAnalyser = vision.BlobAnalysis('BoundingBoxOutputPort', true, ...
                'AreaOutputPort', true, 'CentroidOutputPort', true, ...
                'MinimumBlobArea', 400);
        end

    %%Initialize Tracks
    % The |initializeTracks| function creates an array of tracks, where each
    % track is a structure representing a moving object in the video. The
    % purpose of the structure is to maintain the state of a tracked object.
    % The state consists of information used for detection to track assignment,
    % track termination, and display. 
    %

    % The structure contains the following fields:
    %
    % * |id| :                  the integer ID of the track
    % * |bbox| :                the current bounding box of the object; used
    %                           for display
    % * |kalmanFilter| :        a Kalman filter object used for motion-based
    %                           tracking
    % * |age| :                 the number of frames since the track was first
    %                           detected
    % * |totalVisibleCount| :   the total number of frames in which the track
    %                           was detected (visible)
    % * |consecutiveInvisibleCount| : the number of consecutive frames for 
    %                                  which the track was not detected (invisible).
    %
    % Noisy detections tend to result in short-lived tracks. For this reason,
    % the example only displays an object after it was tracked for some number
    % of frames. This happens when |totalVisibleCount| exceeds a specified 
    % threshold.    
    %
    % When no detections are associated with a track for several consecutive
    % frames, the example assumes that the object has left the field of view 
    % and deletes the track. This happens when |consecutiveInvisibleCount|
    % exceeds a specified threshold. A track may also get deleted as noise if 
    % it was tracked for a short time, and marked invisible for most of the of 
    % the frames.        

        function tracks = initializeTracks()
            % create an empty array of tracks
            tracks = struct(...
                'id', {}, ...
                'bbox', {}, ...
                'kalmanFilter', {}, ...
                'age', {}, ...
                'totalVisibleCount', {}, ...
                'consecutiveInvisibleCount', {});
        end

    %%Read a Video Frame
    % Read the next video frame from the video file.
        function frame = readFrame()
            frame = obj.reader.step();
        end

    %%Detect Objects
    % The |detectObjects| function returns the centroids and the bounding boxes
    % of the detected objects. It also returns the binary mask, which has the 
    % same size as the input frame. Pixels with a value of 1 correspond to the
    % foreground, and pixels with a value of 0 correspond to the background.   
    %
    % The function performs motion segmentation using the foreground detector. 
    % It then performs morphological operations on the resulting binary mask to
    % remove noisy pixels and to fill the holes in the remaining blobs.  

        function [centroids, bboxes, mask] = detectObjects(frame)

            % Detect foreground.
            mask = obj.detector.step(frame);

            % Apply morphological operations to remove noise and fill in holes.
            mask = imopen(mask, strel('rectangle', [3,3]));
            mask = imclose(mask, strel('rectangle', [15, 15])); 
            mask = imfill(mask, 'holes');

            % Perform blob analysis to find connected components.
            [~, centroids, bboxes] = obj.blobAnalyser.step(mask);
        end

    %%Predict New Locations of Existing Tracks
    % Use the Kalman filter to predict the centroid of each track in the
    % current frame, and update its bounding box accordingly.

        function predictNewLocationsOfTracks()
            for i = 1:length(tracks)
                bbox = tracks(i).bbox;

                % Predict the current location of the track.
                predictedCentroid = predict(tracks(i).kalmanFilter);

                % Shift the bounding box so that its center is at 
                % the predicted location.
                predictedCentroid = int32(predictedCentroid) - bbox(3:4) / 2;
                tracks(i).bbox = [predictedCentroid, bbox(3:4)];
            end
        end

    %%Assign Detections to Tracks
    % Assigning object detections in the current frame to existing tracks is
    % done by minimizing cost. The cost is defined as the negative
    % log-likelihood of a detection corresponding to a track.  
    %
    % The algorithm involves two steps: 
    %
    % Step 1: Compute the cost of assigning every detection to each track using
    % the |distance| method of the |vision.KalmanFilter| System object(TM). The 
    % cost takes into account the Euclidean distance between the predicted
    % centroid of the track and the centroid of the detection. It also includes
    % the confidence of the prediction, which is maintained by the Kalman
    % filter. The results are stored in an MxN matrix, where M is the number of
    % tracks, and N is the number of detections.   
    %
    % Step 2: Solve the assignment problem represented by the cost matrix using
    % the |assignDetectionsToTracks| function. The function takes the cost 
    % matrix and the cost of not assigning any detections to a track.  
    %
    % The value for the cost of not assigning a detection to a track depends on
    % the range of values returned by the |distance| method of the 
    % |vision.KalmanFilter|. This value must be tuned experimentally. Setting 
    % it too low increases the likelihood of creating a new track, and may
    % result in track fragmentation. Setting it too high may result in a single 
    % track corresponding to a series of separate moving objects.   
    %
    % The |assignDetectionsToTracks| function uses the Munkres' version of the
    % Hungarian algorithm to compute an assignment which minimizes the total
    % cost. It returns an M x 2 matrix containing the corresponding indices of
    % assigned tracks and detections in its two columns. It also returns the
    % indices of tracks and detections that remained unassigned. 

        function [assignments, unassignedTracks, unassignedDetections] = ...
                detectionToTrackAssignment()

            nTracks = length(tracks);
            nDetections = size(centroids, 1);

            % Compute the cost of assigning each detection to each track.
            cost = zeros(nTracks, nDetections);
            for i = 1:nTracks
                cost(i, :) = distance(tracks(i).kalmanFilter, centroids);
            end

            % Solve the assignment problem.
            costOfNonAssignment = 20;
            [assignments, unassignedTracks, unassignedDetections] = ...
                assignDetectionsToTracks(cost, costOfNonAssignment);
        end

    %%Update Assigned Tracks
    % The |updateAssignedTracks| function updates each assigned track with the
    % corresponding detection. It calls the |correct| method of
    % |vision.KalmanFilter| to correct the location estimate. Next, it stores
    % the new bounding box, and increases the age of the track and the total
    % visible count by 1. Finally, the function sets the invisible count to 0. 

        function updateAssignedTracks()
            numAssignedTracks = size(assignments, 1);
            for i = 1:numAssignedTracks
                trackIdx = assignments(i, 1);
                detectionIdx = assignments(i, 2);
                centroid = centroids(detectionIdx, :);
                bbox = bboxes(detectionIdx, :);

                % Correct the estimate of the object's location
                % using the new detection.
                correct(tracks(trackIdx).kalmanFilter, centroid);

                % Replace predicted bounding box with detected
                % bounding box.
                tracks(trackIdx).bbox = bbox;

                % Update track's age.
                tracks(trackIdx).age = tracks(trackIdx).age + 1;

                % Update visibility.
                tracks(trackIdx).totalVisibleCount = ...
                    tracks(trackIdx).totalVisibleCount + 1;
                tracks(trackIdx).consecutiveInvisibleCount = 0;
            end
        end

    %%Update Unassigned Tracks
    % Mark each unassigned track as invisible, and increase its age by 1.

        function updateUnassignedTracks()
            for i = 1:length(unassignedTracks)
                ind = unassignedTracks(i);
                tracks(ind).age = tracks(ind).age + 1;
                tracks(ind).consecutiveInvisibleCount = ...
                    tracks(ind).consecutiveInvisibleCount + 1;
            end
        end

    %%Delete Lost Tracks
    % The |deleteLostTracks| function deletes tracks that have been invisible
    % for too many consecutive frames. It also deletes recently created tracks
    % that have been invisible for too many frames overall. 

        function deleteLostTracks()
            if isempty(tracks)
                return;
            end

            invisibleForTooLong = 20;
            ageThreshold = 8;

            % Compute the fraction of the track's age for which it was visible.
            ages = [tracks(:).age];
            totalVisibleCounts = [tracks(:).totalVisibleCount];
            visibility = totalVisibleCounts ./ ages;

            % Find the indices of 'lost' tracks.
            lostInds = (ages < ageThreshold & visibility < 0.6) | ...
                [tracks(:).consecutiveInvisibleCount] >= invisibleForTooLong;

            % Delete lost tracks.
            tracks = tracks(~lostInds);
        end

    %%Create New Tracks
    % Create new tracks from unassigned detections. Assume that any unassigned
    % detection is a start of a new track. In practice, you can use other cues
    % to eliminate noisy detections, such as size, location, or appearance.

        function createNewTracks()
            centroids = centroids(unassignedDetections, :);
            bboxes = bboxes(unassignedDetections, :);

            for i = 1:size(centroids, 1)

                centroid = centroids(i,:);
                bbox = bboxes(i, :);

                % Create a Kalman filter object.
                kalmanFilter = configureKalmanFilter('ConstantVelocity', ...
<

            

            
            
            

    
        
	
    
        Best Answer
    

    
								
								Somewhere you need to use bboxes to crop the image with imcrop(). Then save the cropped image with imwrite().
						    
								
    
        
    

Related Solutions
MATLAB: How to remove shadow in our program ,

								
								If your video is RGB, you can try converting the video frame to YCbCr using rgb2ycbr function, and pass only the Cb and the Cr components to the foreground detector. This should reduce the false detections caused by shadows, but you may also start missing real objects that only differ from the background in intensity.
						    
						
MATLAB: How would i track the speed of the centroid of the moving objects

								
								If it is acceptable to compute velocities post hoc, I suggest the following edits to this code:
1. Add tracks as an output so you can process them from another script afterwards:
function tracks = multiObjectTracking()
...
2. In the initializeTracks() function, add a field in the tracks structure to store position over time (I suggest using an animatedline object). Also add a field to keep track of which tracks are active or not:
tracks = struct(...
    ...
    'active',{}, ...
    'position',{});
3. In the createNewTracks() function, instantiate an animated line when you create a new track and set the track to active:
newTrack = struct(...
    ...
    'active',true, ...
    'position',animatedline()); %you can add optional parameters to the animatedline here if you want
4. In the deleteLostTracks() function, instead of deleting tracks, simply set them to inactive:
Replace
lostInds = ...;
tracks = tracks(~lostInds);
with
lostInds = find(...);
for ii=1:length(lostInds)
    tracks(lostInds(ii)).active = false;
end
5. In the displayTrackingResults() function, add a condition to the reliableTrackInds to check active state:
reliableTrackInds = ... & [tracks(:).active];
Then, add the centroid location to the animatedline object for each active track (I suggest after "mask = insertObjectAnnotation..."):
x = double(bboxes(:,1)+bboxes(:,3)/2);
y = double(bboxes(:,2)+bboxes(:,4)/2);
for ii=1:length(reliableTracks)
    addpoints(tracks(reliableTracks(ii).id).position,x(ii),y(ii));
end
6. The last step is to make sure the function predictNewLocationsOfTracks(), detectionToTrackAssignment(), updateAssignedTracks(), and updateUnassignedTracks() ignore the inactive tracks. There are many potential ways to do this; I'll leave this part to you.
7. After the code has been edited, you should be able to run:
tracks = multiObjectTracking();
[x,y] = getpoints(tracks(4).position);
to get the centroid path for the 4th track, for example. The velocity can be computed easily enough with this information.
Hope this helps.


        
            
            
            Related Question
Undefined variable “vision” or class “vision.OpticalFlow”. Error in opticalflow_hornschunk (line 21) OpticalFlow = vision.OpticalFlow(‘ReferenceFrameDelay’,3) This is the error message i am getting please resolve… thanks in advance :)
Implementing Parfor With Tricky Indexes
Trouble with Indexing and Parfor